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Preface 


The fundamental physical and mathematical concepts which underlie 
the path integral approach to quantum mechanics were first developed 
by R. P. Feynman in the course of his graduate studies at Princeton, 
although more fully developed ideas, such as those described in this 
volume, were not worked out until a few years later. These early 
inquiries were involved with the problem of the infinite self-energy 
of the electron. In working on that problem, a “least-action’”’ prin- 
. ciple using half advanced and half retarded potentials was discovered. 
The principle could deal successfully with the infinity arising in the 
application of classical electrodynamics. 

The problem then became one of applying this action principle to 
quantum mechanics in such a way that classical mechanics could 
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arise naturally as a special case of quantum mechanics when h was 
allowed to go to zero. 

Feynman searched for any ideas which might have been previously 
worked out in connecting quantum-mechanical behavior with such 
classical ideas as the lagrangian or, in particular, Hamilton’s principle 
function S, the indefinite integral of the lagrangian. During some 
conversations with a visiting European physicist, Feynman learned 
of a paper in which Dirac had suggested that the exponential function 
of ze times the lagrangian was analogous to a transformation function 
for the quantum-mechanical wave function in that the wave function 
at one moment could be related to the wave function at the next 
moment (a time interval e later) by multiplying with such an expo- 
nential function. 

The question that then arose was what Dirac had meant by the 
phrase “analogous to,” and Feynman determined to find out whether 
or not it would be possible to substitute the phrase “equal to.” A 
brief analysis showed that indeed this exponential function could be 
used in this manner directly. 

Further analysis then led to the use of the exponent of the time 
integral of the lagrangian, S (in this volume referred to as the action) 
as the transformation function for finite time intervals. However, 
in the application of this function it is necessary to carry out integrals 
over all space variables at every instant of time. 

In preparing an article! describing this idea, the idea of “integral 
over all paths” was developed as a way of both describing and evalu- 
ating the required integrations over space coordinates. By this time 
a number of mathematical devices had been developed for applying 
the path integral technique and a number of special applications had 
been worked out, although the primary direction of work at this time 
was toward quantum electrodynamics. Actually, the path integral 
did not then provide, nor has it since provided, a truly satisfactory 
method of avoiding the divergence difficulties of quantum electro- 
dynamics, but it has been found to be most useful in solving other 
problems in that field. In particular, it provides an expression for 
quantum-electrodynamic laws in a form that relativistic invariance 
is obvious. In addition, useful applications to other problems of 
quantum mechanics have been found. 

The most dramatic early application of the path integral method 
to an intractable quantum-mechanical problem followed shortly after 
the discovery of the Lamb shift and the subsequent theoretical diffi- 


1 R. P. Feynman, Space-Time Approach to Non-relativistic Quantum Mechanics, 
Rev. Mod. Phys., vol. 20, p. 367, 1948. 
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culties in explaining this shift without obviously artificial means of 
getting rid of divergent integrals. The path integral approach pro- 
vided one way of handling these awkward infinities in a logical and 
consistent manner. 

The path integral approach was used as a technique for teaching 
quantum mechanics for a few years at the California Institute of 
Technology. It was during this period that A. R. Hibbs, a student 
of Feynman’s, began to develop a set of notes suitable for converting 
a lecture course on the path integral approach to quantum mechanics 
into a book on the same subject. 

Over the succeeding years, as the book itself was elaborated, other 
subjects were brought into both the lectures of Dr. Feynman and the 
book; examples are statistical mechanics and the variational principle. 
At the same time, Dr. Feynman’s approach to teaching the subject 
of quantum mechanics evolved somewhat away from the initial path 
integral approach. At the present time, it appears that the operator 
technique is both deeper and more powerful for the solution of more 
general quantum-mechanical problems. Nevertheless, the path inte- 
gral approach provides an intuitive appreciation of quantum-mechan- 
ical behavior which is extremely valuable in gaining an intuitive 
appreciation of quantum-mechanical laws. For this reason, in those 
fields of quantum mechanics where the path integral approach turns 
out to be particularly useful, most of which are described in this book, 
the physics student is provided with an excellent grasp of basic quan- 
tum-mechanical principles which will permit him to be more effective 
in solving problems in broader areas of theoretical physics. 


R. P. Feynman 
A. R. Hibbs 
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‘The Fundamental 
Concepts of 
Quantum Mechanics 


1-1 PROBABILITY IN QUANTUM MECHANICS! 


From about the beginning of the twentieth century experimental 
physics amassed an impressive array of strange phenomena which 
demonstrated the inadequacy of classical physics. The attempts to 
discover a theoretical structure for the new phenomena led at first to 
a confusion in which it appeared that light, and electrons, behaved 
sometimes like waves and sometimes like particles. This apparent 
inconsistency was completely resolved in 1926 and 1927-in the theory 
called quantum mechanics. The new theory asserts that there are 
experiments for which the exact outcome is fundamentally unpredict- 
able and that in these cases one has to be satisfied with computing 
probabilities of various outcomes. But far more fundamental was the 
discovery that in nature the laws of combining probabilities were not 
those of the classical probability theory of Laplace. The quantum- 
mechanical laws of the physical world approach very closely the laws 
of Laplace as the size of the objects involved in the experiments 
increases. Therefore, the laws of probabilities which are convention- 
ally applied are quite satisfactory in analyzing the behavior of the 
roulette wheel but not the behavior of a single electron or a photon 
of light. 


A Conceptual Experiment. The concept of probability is not 
altered in quantum mechanics. When we say the probability of a 
certain outcome of an experiment is p, we mean the conventional 
thing, i.e., that if the experiment is repeated many times, one expects 
that the fraction of those which give the outcome in question is 
roughly p. We shall not be at all concerned with analyzing or defin- 
ing this concept in more detail; for no departure from the concept 
used in classical statistics is required. 

What is changed, and changed radically, is the method of calculating 
probabilities. The effect of this change is greatest when dealing with 
objects of atomic dimensions. For this reason we shall illustrate the 
laws of quantum mechanics by describing the results to be expected 
in some conceptual experiments dealing with a single electron. 

Our imaginary experiment is illustrated in Fig. 1-1. At A we have 
a source of electrons S. The electrons at S all have the same energy 
but come out in all directions to impinge on a screen B. The screen 


1 Much of the material appearing in this chapter was originally presented as a 
lecture by R. P. Feynman and published as The Concept of Probability in Quantum 
Mechanics in the Second Berkeley Symposium on Mathematical Statistics and 
Probability, University of California Press, Berkeley, Calif., 1951. 
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Fig. 1-1 The experimental arrangement. Electrons emitted at A make their way to the 
detector at screen C, but a screen B with two holes is interposed. The detector registers a 
count for each electron which arrives; the fraction which arrives when the detector is 
placed at a distance x from the center of the screen is measured and plotted against x, as in 
Fig. 1-2. 


B has two holes, 1 and 2, through which the electrons may pass. 
Finally, behind the screen B at a plane C we have a detector of elec- 
trons which may be placed at various distances x from the center of 
the screen. 

If the detector is extremely sensitive (as a Geiger counter is) it will 
be discovered that the current arriving at x is not continuous, but 
corresponds to a rain of particles. If the intensity of the source S is 
very low, the detector will record pulses representing the arrival of 
individual particles, separated by gaps in time during which nothing 
arrives. This is the reason we say electrons are particles. If we had 
detectors simultaneously all over the screen C, with a very weak source 
S, only one detector would respond, then after a little time, another 
would record the arrival of an electron, etc. There would never be a 
half response of the detector; either an entire electron would arrive 
or nothing would happen. And two detectors would never respond 
simultaneously (except for the coincidence that the source emitted 
two electrons within the resolving time of the detectors—a coincidence 
whose probability can be decreased by further decrease of the source 
intensity). In other words, the detector of Fig. 1-1 records the pas- 
sage of a single corpuscular entity traveling from S through a hole in 
screen B to the point 7. 

This particular experiment has never been done in just this way. 
In the following description we are stating what the results would be 
according to the laws which fit every experiment of this type which 
has ever been performed. Some experiments which directly illus- 
trate the conclusions we are reaching here have been done, but such 
experiments are usually more complicated. We prefer, for pedagogical 
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reasons, to select experiments which are the simplest in principle and 
disregard the difficulties of actually doing them. 

Incidentally, if one prefers, one could just as well use light instead 
of electrons in this experiment. The same points would be illustrated. 
The source S could be a source of monochromatic light and the sensi- 
tive detector a photoelectric cell or, better, a photomultiplier which 
would record pulses, each being the arrival of a single photon. 

What we shall measure for various positions x of the detector is the 
mean number of pulses per second. In other words, we shall deter- 
mine experimentally the (relative) probability P that the electron 
passes from S to x, as a function of z. 

The graph of this probability as a function of x is the complicated 
curve illustrated qualitatively in Fig. 1-2a. It has several maxima 
and minima, and there are locations near the center of the screen at 
which electrons hardly ever arrive. It is the problem of physics to 
discover the laws governing the structure of this curve. 

We might at first suppose (since the electrons behave as particles) 
that 


I. Each electron which passes from S to x must go through either 
hole 1 or hole 2. As a consequence of I we expect that: 
II. The chance of arrival at x should be the sum of two parts, Pı, 
the chance of arrival coming through hole 1, plus Pe, the chance of 
arrival coming through hole 2. 


We may find out if this is true by direct experiment. Each of the 
component probabilities is easy to determine. We simply close hole 


(a) (b) (c) (0) 


Fig. 1-2 Results of the experiment. Probability of arrival of electrons at x plotted 
against the position x of the detector. The result of the experiment of Fig. 1-1 is plotted 
here at (a). If only one hole is open, so the electrons can go through just hole 1, the result 
is (b). For just hole 2 open, it is (c). If we imagine each electron just goes through one 


hole or the other, we expect the curve (d) = (b) + (c) when both holes are open. This is 
considerably different from what we actually get, (a). 


The fundamental concepts of quantum mechanics 


5 


E 


Fig. 1-3 An analogous experiment in wave interference. The complicated curve P(z) in 
Fig. 1-2a is the same as the intensity I(x) of waves which would arrive at x starting from S 
and coming through the holes. At some points z the wavelets from holes 1 and 2 interfere 
destructively (e.g., a crest from hole 1 arrives at the same time as a trough from hole 2) sat 
others, constructively. This produces the complicated minima and maxima of the curve 


2 and measure the chance of arrival at x with only hole 1 open. This 
gives the chance P, of arrival at x for electrons coming through 1. 
The result is given in Fig. 1-2b. Similarly, by closing 1 we find the 
chance P» of arrival through hole 2 (Fig. 1-2c). 

The sum of these (Fig. 1-2d) clearly does not agree with the curve (a). 


. Hence, experiment tells us definitely that P عد‎ Pı + Ps, or that II 


is false. 


The Probability Amplitude. The chance of arrival at x with 
both holes open is not the sum of the chance with just hole 1 open 
plus that with just hole 2 open. 

Actually, the complicated curve P(x) is familiar, inasmuch as it is 
exactly the intensity of distribution in the interference pattern to be 
expected if waves starting from S pass through the two holes and 
impinge on the screen C (Fig. 1-3). The easiest way to represent 
wave amplitudes is by complex numbers. We can state the correct 
law for P(x) mathematically by saying that P(x) is the absolute square 
of a certain complex quantity (if electron spin is taken into account, 
it is a hypercomplex quantity) (x) which we call the probability ampli- 
tude of arrival at x. Furthermore, (x) is the sum of two contribu- 
tions: ررم‎ the amplitude of arrival through hole 1, plus روف‎ the ampli- 
tude of arrival through hole 2. In other words, 


III. There are complex numbers ¢; and ¢» such that 
p= dit ول‎ (1-2) 


I(x). 
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and 
Py = |¢y|? P = |¢:|? (1-3) 


In later chapters we shall discuss in detail the actual calculation of 
رف‎ and ¢». Here we say only that ¢1, for example, may be calculated 
as a solution of a wave equation representing waves spreading from 
the source to 1 and from 1 to z. This reflects the wave properties of 
electrons (or in the case of light, photons). l 

To summarize: We compute the intensity (i.e., the absolute square 
of the amplitude) of waves which would arrive in the apparatus at نك‎ 
and then interpret this intensity as the probability that a particle will 
arrive at xv. 


Logical Difficulties. What is remarkable is that this dual use of 
wave and particle ideas does not lead to contradictions. This is so 
only if great care is taken as to what kind of statements one is per- 
mitted to make about the experimental situation. 

To discuss this point in more detail, consider first the situation which 
arises from the observation that our new law III of composition of 
probabilities implies, in general, that it is not true that P = Pı + Pœ». 
We must conclude that when both holes are open, it is not true that 
the particle goes through one hole or the other. For if it had to go 
through one or the other, we could classify all the arrivals at x into 
two disjoint classes, namely, those arriving via hole 1 and those arriv- 
ing through hole 2; and the frequency P of arrival at v would surely 
be the sum of the frequency 21 of particles coming through hole 1 and 
the frequency و2‎ of those coming through hole 2. 

To extricate ourselves from the logical difficulties introduced by this 
startling conclusion, we might try various artifices. We might say, 
for example, that perhaps the electron travels in a complex trajectory 
going through hole 1, then back through hole 2 and finally out through 
1 in some complicated manner. Or perhaps the electron spreads out 
somehow and passes partly through both holes so as to eventually pro- 
duce the interference result III. Or perhaps the chance P, that the 
electron passes through hole 1 has not been determined correctly 
inasmuch as closing hole 2 might have influenced the motion near 
hole 1. Many such classical mechanisms have been tried to explain 
the result. When light photons are used (in which case.the same law 
III applies), the two interfering paths 1 and 2 can be made to be many 
centimeters apart (in space), so that the two alternative trajectories 
must almost certainly be independent. That the actual situation is 
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more profound than might at first be supposed is shown by the follow- 
ing experiment. 


The Effect of Observation. We have concluded on logical 
grounds that since P ¥ Pı + بوط‎ it is not true that we can expect to 
analyze the electron’s motion by the simple assumption that the elec- 
tron passes through either hole 1 or hole 2. But it is easy to design 
an experiment to test our conclusion directly. We have merely to 
have a source of light behind the holes and watch to see through which 
hole the electron passes (see Fig. 1-4). For electrons scatter light, so 
that if light is scattered behind hole 1, we may conclude that an elec- 
tron passed through hole 1; and if it is scattered in the neighborhood 
of hole 2, the electron has passed through hole 2. 

The result of this experiment is to show unequivocally that the elec- 
tron does pass through either hole 1 or hole 2! That is, for every elec- 
tron which arrives at the screen C (assuming the light is strong enough 
that we do not miss seeing it) light is scattered either behind hole 1 
or behind hole 2, and never (if the source S is very weak) at both places. 
(A more delicate experiment could even show that the charge passing 
through the holes passes through either one or the other and is in all 
cases the complete charge of one electron and not a fraction of it.) 

It now appears that we have come to a paradox. For suppose that 
we combine the two experiments. We watch to see through which 
hole the electron passes and at the same time measure the chance that 
the electron arrives at x. Then for each electron which arrives at x 
we can say experimentally whether it came through hole 1 or hole 2. 
First we may verify that رط‎ is given by curve (b), because if we 
select, of the electrons which arrive at x, only those which appear to 
come through hole 1 (by scattering light there), we find they are, 


Fig. 1-4 A modification of the 
experiment of Fig. 1-1. Here 
we place a light source L behind 
the screen B and look for light 
scattered by the electrons pass- 
ing through hole 1 or hole 2. 
With a strong light source every 
electron is indeed found to pass 
by one or the other hole. But 
now the probability of arrival 
at x is no longer given by the 
curve of Fig. 1-2a, but is instead 
given by Fig. 1-2d. 
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indeed, very nearly distributed asin curve (b). (This result is obtained 
whether hole 2 is open or closed, so we have verified that there is no 
subtle influence of closing 2 on the motion near hole 1.) If we select 
the electrons scattering light at 2, we get (very nearly) و2‎ of Fig. 1-2. 
But now each electron appears at either 1 or 2 and we can separate 
our electrons into disjoint classes. So, if we take both together, we 
must get the distribution P = P, + وم‎ illustrated in Fig. 1-20. And 
experimentally we do! Somehow now the distribution does not show 
the interference effects III of curve (a)! 

What has been changed? When we watch the electrons to see 
through which hole they pass, we obtain the result P = Pı + .وم‎ 
When we do not watch, we get a different result, 


P = |¢1 + ¢:|? + Pi + P: 


Just by watching the electrons, we have changed the chance that 
they arrive at x. How is this possible? The answer is that, to watch 
them, we used light and the light in collision with the electron may 
be expected to alter its motion, or, more exactly, to alter its chance 
of arrival at z. 

On the other hand, can we not use weaker light and thus expect a 
weaker effect? A negligible disturbance certainly cannot be presumed 
to produce the finite change in distribution from (a) to (d). But weak 
light does not mean a weaker disturbance. Light comes in photons 
of energy hv, where v is the frequency, or of momentum 72/2, where A 
is the wavelength. Weakening the light just means using fewer pho- 
tons, so that we may miss seeing an electron. But when we do see 
one, it means a complete photon was scattered and a finite momentum 
of order A/A is given to the electron. 

The electrons that we miss seeing are distributed according to the 
interference law (a), while those we do see and which therefore have 
scattered a photon arrive at x with the probability P = Pı + Ps,in (d). 
The net distribution in this case is therefore the weighted mean of (a) 
and (d). In strong light when nearly all electrons scatter light, it is 
nearly (d); and in very weak light, when very few scatter, it becomes 
more like (a). 

It might still be suggested that since the momentum carried by the 
light is 7/2, weaker effects could be produced by using light of longer 
wavelength A. But there is a limit to this. If light of too long a 
wavelength is used, we shall not be able to tell whether it was scattered 
from behind hole 1 or hole 2; for the source of light of wavelength \ 
cannot be located in space with precision greater than order ^. 

We thus see that any physical agency designed to determine through 
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which hole the electron passes must produce, lest we have a paradox, 
enough disturbance to alter the distribution from (a) to (d). 

It was first noticed by Heisenberg, and stated in his uncertainty 
principle, that the consistency of the then-new mechanics required a 
limitation to the subtlety to which experiments could be performed. 
In our case the principle says that an attempt to design apparatus to 
determine through what hole the electron passed, and delicate enough 
so as not to deflect the electron sufficiently to destroy the interference 
pattern, must fail. It is clear that the consistency of quantum 
mechanics requires that it must be a general statement involving all 
the agencies of the physical world which might be used to determine 
through which hole an electron passes. The world cannot be half 
quantum-mechanical, half classical No exception to the uncertainty 
principle has been discovered. 


THE UNCERTAINTY PRINCIPLE 


We shall state the uncertainty principle as follows: Any determination 
of the alternative taken by a process capable of following more than 
one alternative destroys the interference between alternatives. 
Heisenberg’s original statement of the uncertainty principle was not 
given in the form we have used here. We shall interrupt our argu- 
ment for a few paragraphs to discuss Heisenberg’s original statement. 

In classical physics a particle can be described as moving along a 
definite trajectory and having, for example, a precise position and 
velocity at any particular time. Such a picture would not lead to the 
odd results that we have seen are characteristic of quantum mechanics. 
Heisenberg’s uncertainty principle gives the limits of accuracy of such 
classical ideas. For example, the idea that a particle has both a defi- 
nite position and a definite momentum has its limitations. A real 
system (i.e., one obeying quantum mechanics) looked upon from a 
classical view appears to be one in which the position or momentum 
is not definite, but is uncertain. The uncertainty in position can be 
reduced by careful measurement, and other measurements may make 
the momentum definite. But, as Heisenberg stated in his principle, 
both cannot be accurately known simultaneously; the product of the 
uncertainties of momentum and position involved in any experiment 
cannot be smaller than a number with the order of magnitude of 1 
That such a result is required by physical consistency in the situation 


th = h/2r = 1,054 X 10-87 erg-sec, where h = Planck’s constant, 
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we have been discussing can be shown by considering still another way 
of trying to determine through which hole the electron passes. 


Example. Notice that if an electron is deflected in passing through 
one of the holes, its vertical component of momentum is changed. 
Furthermore, an electron arriving at the detector ند‎ after passing 
through hole 1 is deflected by a different amount, and thus suffers a 
different change in momentum, than an electron arriving at x via 
hole 2. Suppose that the screen at B is not rigidly supported, but is 
free to move up and down (Fig. 1-5). Any change in the vertical 
component of the momentum of an electron upon passing through a 
hole will be accompanied by an equal and opposite change in the 
momentum of the screen. This change in momentum can be meas- 
ured by measuring the velocity of the screen before and after the pas- 
sage of an electron. Call 6p the difference in momentum change 
between electrons passing through hole 1 or hole 2. Then an unam- 
biguous determination of the hole used by a particular electron requires 
a momentum determination of the screen to an accuracy of better 
than dp. 

If the experiment is set up in such a way that the momentum of 
screen B can be measured to the required accuracy, then, since we can 
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Fig. 1-5 Another modification of the experiment of Fig. 1-1. The screen B is left free to 
move vertically. If the electron passes hole 2 and arrives at the detector (at x = 0, for 
example), it is deflected upward and the screen B should recoil downward. The hole 
through which the electron passes can be determined for each passage by starting with the 
screen at rest and measuring whether it is recoiling up or down afterward. According to 
Heisenberg’s uncertainty principle, however, such precise momentum measurements on 
screen B would be inconsistent with accurate knowledge of its vertical position, so we could 
not be sure that the center line of the holes is correctly set. Instead of P (x) of Fig. 1-2a, we 
get this smeared a little in the vertical direction, so it looks like Fig. 1-2d. 
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determine the hole passed through, we must find that the resulting 
distribution of electrons is that of curve (d) of Fig. 1-2. The inter- 
ference pattern of curve (a) must be lost. How can this happen? 
To understand, note that the construction of a distribution curve in 
the plane C requires an accurate knowledge of the vertical position 
of the two holes in screen B. Thus we must measure not only the 
momentum of screen B but also its position. If the interference pat- 
tern of curve (a) is to be established, the vertical position of B must 
be known to an accuracy of better than d/2, where d is the spacing 
between maxima of the curve (a). For suppose the vertical position 
of B is not known to this accuracy; then the vertical position of every 
point in Fig. 1-2a cannot be specified with an accuracy greater than 
d/2, since the zero point of the vertical scale must be lined up with 
some nominal zero point on B. Then the value of P at any particular 
height x must be obtained by averaging over all values within a dis- 
tance d/2 of x. Clearly, the interference pattern will be smeared out 
by this averaging process. The resulting curve will look like Fig. 1-2d. 

The interference pattern in the original experiment is the sign of a 
wave-like behavior of the electrons. The pattern is the same for any 
wave motion, so we may use the well-known result from the theory of 
light diffraction that the relation between the separation a of the holes, 
the distance / between screen B and the plane C, the wavelength à of 
the light, and d is 


a‏ ا 


(1-4) 


Ql > 


as shown in Fig. 1-6. In Chap. 3 we shall find that the wavelength 
of the electron waves is intimately connected with the momentum of 
the electron by the relation 


)1-5( - م 


If p is the total momentum of an electron (and we assume all the elec- 
trons have the same total momentum), then for 1 >> a 
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as shown in Fig. 1-7. It follows that 
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Fig. 1-6 Two beams of light, starting in phase at holes 1 and 2, will interfere construc- 
tively when they reach the screen C if they take the same time to travel from Bto C. This 
means that a maximum in the diffraction pattern for light beams passing through two holes 
will occur at the center of the screen. As we move down the screen, the next maximum will 
occur at a distance d, which is far enough from the center that, in traveling to this point, 
the beam from hole 1 will have traveled exactly one wavelength à farther than the beam 
from hole 2. 


® 
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Fig. 1-7 The deflection of an electron in passing through a hole in the screen B is actually 
a change in momentum ôp. This change amounts to the addition of a small component of 
momentum in a direction approximately perpendicular to the original momentum vector. 
The change in energy is completely negligible. For small deflection angles, the total 
momentum vector keeps the same magnitude (approximately). Then the deflection angle 
is represented to a very good approximation by |ép|/|p|. If two electrons, one starting 
from hole 1 with momentum p; and the other starting from hole 2 with momentum p», 
reach the same point on the screen C, then the angles through which they were deflected 
must differ by approximately a/l. Since we cannot say through which hole an electron 
has come, the uncertainty in the vertical component of momentum which the electron 
receives on passing through the screen B must be equivalent to this uncertainty in deflection 
angle. This gives relation |p: — p2|/|p| = |êp|/|p| = a/l. 
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Since experimentally we find that the interference pattern has been 
lost, it must be that the uncertainty ôx in the measurement of the 
position of B is larger than d/2. Thus 


h 
bp bx > 5 (1-8) 


which agrees (in order of magnitude) with the usual statement of the 
uncertainty principle. 

A similar analysis can be applied to the previous measuring device 
where the scattering of light was used to determine through which 
hole the electron passed. Such an analysis produces the same lower 
limit for the uncertainties of measurement. 

The uncertainty principle is not “proved” by considering a few such 
experiments. It is only illustrated. The evidence for it is of two 
kinds. First, no one has yet found any experimental way to defeat 
the limitations in measurements which it implies. Second, the laws 
of quantum mechanics seem to require it if their consistency is to be 
maintained, and the prediction of these laws has been confirmed again 
and again with great precision. 


INTERFERING ALTERNATIVES 


Two Kinds of Alternatives. From a physical standpoint the two 
routes are independent alternatives, yet the implication that the prob- 
ability is the sum Pı + P2is false. This means that either the premise 
or the reasoning which leads to such a conclusion must be false. Since 
our habits of thought are very strong, many physicists find that it is 
much more convenient to deny the premise than to deny the reasoning. 
To avoid the logical inconsistencies into which it is so easy to stumble, 
they take the following view: When no attempt is made to determine 
through which hole the electron passes, one cannot say it must pass 
through one hole or the other. Only in a situation where apparatus 
is operating to determine through which hole the electron goes is it 
permissible to say that it passes through one or the other. When you 
watch, you find that it goes through either one hole or the other hole; 
but if you are not looking, you cannot say that it goes either one way 
or the other! Nature demands that we walk a logical tightrope if we 
wish to describe her. 

Contrary to that way of thinking, we shall in this book follow the 
suggestion made in the first part of this chapter and deny the reason- 
ing; i.e., we shall not compute probabilities by adding probabilities for 
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all alternatives. In order to make definite the new rules for combining 
probabilities, it will be convenient to define two meanings for the word 
“alternative.” The first of these meanings carries with it the concept 
of exclusion. Thus holes 1 and 2 are exclusive alternatives if one of 
them is closed or if some apparatus that can unambiguously determine 
which hole is used is operating. The other meaning of the word 
“alternative” carries with it a concept of combination or interference. 
(The term interference has the same meaning here as it has in optics, 
i.e., either constructive or destructive interference.) Thus we shall 
say, holes 1 and 2 present interfering alternatives to the electron when 
(1) both holes are open and (2) no attempt is made to determine 
through which hole the electron passes. When the alternatives are 
of this interfering type, the laws of probability must be changed to 
the form given in Eqs. (1-1) and (1-2). 

The concept of interfering alternatives is fundamental to all of 
quantum mechanics. In some situations we may have both kinds of 
alternatives present. Suppose we ask, in the two-hole experiment, 
for the probability that the electron arrives at some point, say, within 
1 cm of the center of the screen. We may mean by that the proba- 
bility that if there were counters arranged all over the screen (so one 
or another would go off when the electron arrived), the counter 
which went off was within 1 em of z = 0. Here the various possi- 
bilities are that the electron arrives at some counter via some hole. 
The holes represent interfering alternatives, but the counters are exclu- 
sive alternatives. Thus we first add $1 + ¢» for a fixed x, square that, 
and then sum these resultant probabilities over x from —1 to +1 cm. 

It is not hard, with a little experience, to tell which kind of alterna- 
tives is involved. For example, suppose that information about the 
alternatives is available (or could be made available without altering 
the result), but this information is not used. Nevertheless, in this 
case a sum of probabilities (in the ordinary sense) must be carried out 
over exclusive alternatives. These exclusive alternatives are those 
which could have been separately identified by the information. 


Some Illustrations. When alternatives cannot possibly be 
resolved by any experiment, they always interfere. A striking illus- 
tration of this is the scattering of two nuclei at 90°, say, in the center- 
of-gravity system, as illustrated in Fig. 1-8. Suppose A represents 
an a particle and B some other nucleus. Ask for the probability that 
A is scattered to position 1 and B to 2. The amplitude is, say, 
gaz(1,2). The probability of this is p = |¢4x(1,2)|?. Suppose we do 
not distinguish what kind of nucleus arrives at 1, that is, whether it 
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de 1 Fig. 1-8 Scattering of one 
nucleus by another in the center- 


of-gravity system. The scat- 
tering of two identical nuclei 
shows striking interference effects. 
There are two interfering alterna- 
tives here. The particle which 
8 arrives at 1, say, can have been 
that which started either from A 
or from B. If the original nuclei 
were not identical, tests of 
identity at 1 could determine 
which alternative had actually 
been taken, so they are exclusive 
alternatives and the special inter- 
—- ference effects do not arise in 
2 this case. 


is Bor A. If itis B, the amplitude is ¢42(2,1) [which equals ¢42(1,2), 
because we have taken a 90° angle]. The chance that some nucleus 
arrives at 1 and the other at 2 is 


|#an(1,2)|? + |¢40(2,1)|? = 2p (1-9) 


We have added the probabilities. The cases A arrives at 1 and B 
arrives at 1 are exclusive alternatives because we could, if we wished, 
determine the character of the nucleus at 1 without disturbing the 
previous scattering process. 

But if A is an a particle, what happens if B is also an a particle? 
Then no experiment can distinguish which is which, and we cannot 
know when one arrives at 1 whether itis A or B. We have interfering 
alternatives, and the probability is 


4p (1-10)‏ = ?|)42(2,1 + )1,2(ۍ#4| 


This interesting result is readily verified experimentally. 

If electrons scatter electrons, the result is different in two ways. 
First, the electron has a quality we call spin, and a given electron may 
be in one of two states called spin up and spin down. The spin is not 
changed to first approximation for scattering at low energy. The spin 
carries a magnetic moment. At low velocities the main forces are 
electrical, owing to charge, and the magnetic influences make only a 
small correction, which we neglect. So if electron A has spin up and 
B has spin down, we could later tell which arrived at 1 by measuring 
its spin. If up, it is A; if down, it is B. The scattering probability 
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is then 
|4(1,2)| + |6(2,1)|? = 2p (1-11) 


in this case. 
If, however, both A and B start with spin up, we cannot later tell 
which is which and we would expect 


|@an(1,2) T pag(2,1)|? =: 4p (1-12) 


Actually this is wrong and, remarkably, electrons obey a different rule. 
The amplitude for an event in which the identity of a pair of electrons 
is reversed contributes 180° out of phase. That is, the case of both 
spin up gives 


|ean(1,2) — dan(2,1)|? (1-13) 


In our case of 90° scattering ¢42(1,2) = ¢42(2,1), so this is zero. 


Fermions and Bosons. This rule of the 180° phase shift for alter- 
natives involving exchange in identity of electrons is very odd, and 
its ultimate reason in nature is still only imperfectly understood. 
Other particles besides electrons obey it. Such particles are called 
fermions, and are said to obey Fermi, or antisymmetric, statistics. 
Electrons, protons, neutrons, neutrinos, and » mesons are fermions. 
So are compounds of an odd number of these such as a nitrogen atom, 
which contains seven electrons, seven protons, and seven neutrons. 
This 180° rule was first stated by Pauli and is the full quantum- 
mechanical basis of his exclusion principle, which controls the char- 
acter of the chemists’ periodic table. 

Particles for which interchange does not alter the phase are called 
bosons and are said to obey Bose, or symmetrical, statistics. Exam- 
ples of bosons are photons, 7 mesons, and systems containing an even 
number of Fermi particles such as an a particle, which is two protons 
and two neutrons. All particles are either one or the other, bosons 
or fermions. These interference properties can have profound and 
mysterious effects. For example, helium liquid made of atoms of 
atomic mass 4 (bosons) at temperatures of one or two degrees Kelvin 
can flow without any resistance through small tubes, whereas the 
liquid made of atoms of mass 3 (fermions) does not have this property. 

The concept of identity of particles is far more complete and defi- 
nite in quantum mechanics than it is in classical mechanics. Classi- 
cally, two particles which seem identical could be nearly identical, or 
identical for all practical purposes, in the sense that they may be so 
closely equal that present experimental techniques cannot detect any 
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difference. However, the door is left open for some future technique 
to establish the difference. In quantum mechanics, however, the 
situation is different. We can give a direct test to determine whether 
or not particles are completely indistinguishable. 

If the particles in the experiment diagramed in Fig. 1-8, starting 
at A and B, were only approximately identical, then improvements 
in experimental techniques would enable us to determine by close 
scrutiny of the particle arriving at 1, for example, whether it came 
from A or B. In this situation the alternatives of the two initial 
positions must be exclusive, and there must be no interference between 
the amplitudes describing these alternatives. Now the important 
point is that this act of scrutiny would take place after the scattering 
had taken place. This means that the observation could not possibly 
affect the scattering process, and this in turn implies that we would 
expect no interference between the amplitudes describing the alterna- 
tives (that it is either the particle from A or the particle from B which 
arrives at 1). In this case we must conclude from the uncertainty 
principle that there is no way, even in principle, to ever distinguish 
between these possibilities. That is, when a particle arrives at 1, it 
is completely impossible by any test whatsoever, now or in the future, 
to determine whether the particle started from A or B. In this more 
rigorous sense of identity, all electrons are identical, as are all pro- 
tons, etc. 

As a second example we consider the scattering of neutrons from a 
crystal. When neutrons of wavelength somewhat shorter than the 
atomic spacing are scattered from the atoms in a crystal, we get very 
strong interference effects. The neutrons emerge only in certain dis- 
crete directions determined by the Bragg law of reflection, just as for 
X rays. The interfering alternatives which enter this example are 
the alternative possibilities that it is this, or that, atom which does the 
scattering of a particular neutron. (The amplitude to scatter neu- 
trons from any atom is so small that we need not consider alternatives 
in which a neutron is scattered by more than one atom.) The waves 
of amplitude describing the motion of a neutron which start from these 
atoms interfere constructively only in certain definite directions. 

Now there is an interesting complication which enters this appar- 
ently simple picture. Neutrons, like electrons, carry a spin, which 
can be analyzed in two states, spin up and spin down. Suppose the 
scattering material is composed of an atomic species which has a simi- 
lar spin property, such as carbon-13. In this case an experiment will 
reveal two apparently different types of scattering. It is found that 
besides the scattering in discrete directions, as described in the pre- 
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ceding paragraph, there is a diffused scattering in all directions: Why 
should this be? 

A clue to the source of these two types of scattering is provided by 
the following observation. Suppose all the neutrons which enter the 
experiment are prepared in such a-manner that their spin direction 
is up. If the spin direction of the emerging neutrons is analyzed, it 
will be found that there are some up and some down; those which 
still have spin up are scattered only at the discrete Bragg angles, while 
those whose spin has been changed to down come out’ scattered dif- 
fusely in all directions! 

Now in order that a neutron flip its spin from up to down, the law 
of conservation of angular momentum requires that the spin of the 
scattering nucleus be changed from down to up. Therefore, in prin- 
ciple, the particular nucleus which was responsible for scattering that 
particular neutron could be determined. We could, in principle, note 
down before the experiment the spin state of all the scattering nuclei 
in the crystal. Then, after the neutron is scattered, we could rein- 
vestigate the crystal and see which nucleus had changed its spin 
from down to up. If no crystal nucleus underwent such a change in 
spin, then neither did the neutron, and we cannot tell from which 
nucleus the neutron was actually scattered. In this case the alterna- 
tives interfere and the Bragg law of scattering results. 

If, on the other hand, one crystal nucleus is found to have changed 
spin, then we know that this nucleus did the scattering. There are 
no interfering alternatives. The spherical waves of amplitude which 
emerge from this particular nucleus describe the motion of the scat- 
tered neutron, and only the waves emerging from this nucleus enter 
into that description. In this case there is equal likelihood to find 
the scattered neutron coming out in any direction. 

The concept of searching through all the nuclei in a crystal to find 
which one has changed its spin state is surely a needle-in-the-haystack 
type of activity, but nature is not concerned with the practical diffi- 
culties of experimentation. The important fact is that in principle it 
is possible without producing any disturbance of the scattered neu- 
tron to determine (in this latter case where the spin states change) 
which crystal nucleus actually did the scattering. The existence of 
this possibility means that even if we do not actually carry out this 
determination, we are nevertheless dealing with exclusive (and thus 
noninterfering) alternatives. 

On the other hand, the fact that we get interference between alter- 
natives in the situation where the spin states of the neutrons were not 
changed means that it is impossible, even in principle, to ever discover 


18 


The fundamental concepts of quantum mechanics 
19 


which particular crystal nucleus did the scattering—impossible, at 
least, without disturbing the situation during or before the scattering. 


SUMMARY OF PROBABILITY CONCEPTS 


Alternatives and the Uncertainty Principle. The purpose of 
this introductory chapter has been to explain the meaning of a proba- 
bility amplitude and its importance in quantum mechanics and to dis- 
cuss the rules for manipulation of these amplitudes. Thus we have 
stated that there is a quantity called a probability amplitude associated 
with every method whereby an event in nature can take place. For 
example, an electron going from a source at S (Fig. 1-1) to a detector 
at x has one amplitude for completing this course while passing through 
hole 1 of the screen at B and another amplitude in passing through 
hole 2. Further, we can associate an amplitude with the overall event 
by adding together the amplitudes of each alternative method. Thus, 
for example, the overall amplitude for arrival at x is given in Eq. 
(1-2) as 


)1-14( وو + رم = $ 


Next, we interpret the absolute square of the overall amplitude as 
the probability that the event will happen. For example, the proba- 
bility that an electron reaches the detector is 


P = |. + $2)? (1-15) 


If we interrupt the course of the event before its conclusion with 
an observation on the state of the particles involved in the event, we 
disturb the construction of the overall amplitude. Thus if we observe 
the system of particles to be in one particular state, we exclude the 
possibility that it can be in any other state, and the amplitudes asso- 
ciated with the excluded states can no longer be added in as alterna- 
tives in computing the overall amplitude. For example, if we deter- 
mine with the help of some sort of measuring equipment that the 
electron passes through hole 1, the amplitude for arrival at the detector 
is just .رف‎ Further, it does not matter if we actually observe and 
record the outcome of the measurement or not, so long as the meas- 
uring equipment is working. Obviously, we could observe the out- 
come at any time we wished. The operation of the measuring equip- 
ment is sufficient to disturb the system and its probability amplitude. 

This latter fact is the basis of the Heisenberg uncertainty principle, 
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which states that there is a natural limit to the subtlety of any experi- 
ment or the refinement of any measurement. 


The Structure of the Amplitude. The amplitude for an event 
is the sum of the amplitudes for the various alternative ways that the 
event can occur. This permits the amplitude to be analyzed in many 
different ways depending on the different classes into which the alter- 
natives can be divided. The most detailed analysis results from con- 
sidering that a particle going from A to B, for example,.in a given time 
interval, can be considered to have done this by going in a certain 
motion (position vs. time) or path in space and time. We shall there- 
fore associate an amplitude with each possible motion. The total 
amplitude will be the sum of a contribution from each of the paths. 

This idea can be made more clear by a further consideration of our 
experiment with the two holes. Suppose we put a couple of extra 
screens between the source and the hole. Call these screens D and E. 
In each of them we drill a few holes which we number Dı, Ds, . . . 
and Hy, E», ... (Fig. 1-9). For simplicity, we shall assume the 
electrons are constrained to move in the zy plane. Then there are 
several alternative paths which an electron may take in going from 
the source to the hole in screen B. It could go from source to Dz, 
and then ,و‎ and then the hole 1; or it could go from the source to 
D;, then Hy, and finally to the hole 1, ete. Each of these paths has 
its own amplitude. The complete amplitude is the sum of all of them. 

Next, suppose we continue to drill holes in the screens D and 77 
until there is nothing left of the screens. The path of an electron 
must now be specified by the height zp at which the electron passes 
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Fig. 1-9 When several holes are drilled in the screens D and E placed between the source 
in screen A and the final position in screen C, several alternative routes are available for 
each electron. For each of these routes there is an amplitude. The result of any experi- 


ment in which all of the holes are open requires the addition of all of these amplitudes, one 
for each possible path. 
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Fig. 1-10 More and more holes are cut in the screens at yp and yg. Eventually, the 
screens are completely riddled with holes, and the electron has a continuous range of posi- 
tions, up and down along each screen, at which it can pass through the position of the 
screen. In this case the sum of alternatives becomes a double integral over the continuous 
parameters zo and zg describing the alternative heights at which the electron passes the 
position of the screens at yp and yz. 


the position yp at the nonexistent screen D, together with the height 
zg, at the position yz, as in Fig. 1-10. To each pair of heights there 
corresponds an amplitude. The principle of superposition still applies, 
and we must take the sum (or by now, the integral) of these amplitudes 
over all possible values of xp and zz. 

Clearly, the next thing to do is to place more and more screens 
between the source and the hole 1 and in each screen drill so many 
holes that there is nothing left. Throughout this process we continue 
to refine the definition of the path of the electron, until finally we 
arrive at the sensible idea that a path is merely height as a particular 
function of distance, or z(y). We also continue to apply the principle 
of superposition, until we arrive at the integral over all paths of the 
amplitude for each path. 

Now we can make a still finer specification of the motion. Not 
only can we think of the particular path x(y) in space, but we can 
specify the time at which it passes each point in space. That is, a 
path will (in our two-dimensional case) be given if the two functions 
x(t), y(t) are given. Thus we have the idea of an amplitude to take 
a certain path z(t), y(t). The total amplitude to arrive is the sum or 
integral of this amplitude over all possible paths. The problem of 
defining this concept of a sum or integral over all paths in a mathe- 
matically more precise way will be taken up in Chap. 2. 

Chapter 2 also contains the formula for the amplitude for any 
given path. Once this is given, the laws of nonrelativistic quantum 
mechanics are completely stated, and all that remains is a demonstra- 
tion of the application of these laws in a number of interesting special 
cases. 


Quantum mechanics and path integrals 


22 


1-5 SOME REMAINING THOUGHTS 


We shall find that in quantum mechanics, the amplitudes ¢ are solu- 
tions of a completely deterministic equation (the Schrödinger equation). 
Knowledge of ¢ at t = 0 implies its knowledge at all subsequent times. 
The interpretation of |¢|? as the probability of an event is an indeter- 
ministic interpretation. It implies that the result of an experiment 
is not exactly predictable. It is very remarkable that this interpre- 
tation does not lead to any inconsistencies. That it is true has been 
amply demonstrated by analyses of many particular situations by 
Heisenberg, Bohr, Born, von Neumann, and many other physicists. 
In spite of all these analyses the fact that no inconsistency can arise 
is not thoroughly obvious. For this reason quantum mechanics 
appears as a difficult and somewhat mysterious subject to a beginner. 
The mystery gradually decreases as more examples are tried out, but 
one never quite loses the feeling that there is something peculiar about 
the subject. 

There are a few interpretational problems on which work may still 
be done. They are very difficult to state until they are completely 
worked out. One is to show that the probability interpretation of ¢ 
is the only consistent interpretation of this quantity. We and our 
measuring instruments are part of nature and so are, in principle, 
described by an amplitude function satisfying a deterministic equation. 
Why can we only predict the probability that a given experiment will 
lead to a definite result? From what does the uncertainty arise? 
Almost without doubt it arises from the need to amplify the effects 
of single atomic events to such a level that they may be readily 
observed by large systems. The details of this have been analyzed 
only on the assumption that |¢!? is a probability, and the consistency 
of this assumption has been shown. It would be an interesting prob- 
lem to show that no other consistent interpretation can be made. 

Other problems which may be further analyzed are those dealing 
with the theory of knowledge. For example, there seems to be a lack 
of symmetry in time in our knowledge. Our knowledge of the past is 
qualitatively different from that of the future. In what way is only 
the probability of a future event accessible to us, whereas the certainty 
of a past event can often apparently be asserted? These matters 
again have been analyzed to a great extent. Possibly a little more 
can be said to clarify the situation, however. Obviously, we are again 
involved in the consequences of the large size of ourselves and of our 
measuring equipment. The usual separation of observer and observed 
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which is now needed in analyzing measurements in quantum mechanics 
should not really be necessary, or at least should be even more thor- 
oughly analyzed. What seems to be needed is the statistical mechanics 
of amplifying apparatus. 

The analyses of such problems are, of course, in the nature of philo- 
sophical questions. They are not necessary for the further develop- 
ment of physics. We know we have a consistent interpretation of ¢ 
and, almost without doubt, the only consistent one. The problem of 
today seems to be the discovery of the laws governing the behavior 
of ¢ for phenomena involving nuclei and mesons. The interpretation 
of ¢ is interesting. But the much more intriguing question is: What 
new modifications of our thinking will be required to permit us to 
analyze phenomena occurring within nuclear dimensions? 


THE PURPOSE OF THIS BOOK 


So far, we have given the form the quantum-mechanical laws must 
take, i.e., that a probability amplitude exists, and we have outlined 
one possible scheme for calculating this amplitude. There are other 
ways to formulate this. In a more usual approach to quantum 
mechanics the amplitude is calculated by solving a kind of wave 
equation. For particles of low velocity, it is called the Schrédinger 
equation. A more accurate equation valid for electrons of velocity 
arbitrarily close to the velocity of light is the Dirac equation. In this 
case the probability amplitude is a kind of hypercomplex number. 
We shall not discuss the Dirac equation in this book, nor shall we 
investigate the effects of spin. Instead, we limit our attention to low- 
velocity electrons, extending our horizon somewhat in the direction 
of quantum electrodynamics by investigating photons, particles whose 
behavior is determined by Maxwell’s equation. 

In this book we shall give the laws to compute the probability ampli- 
tude for nonrelativistic problems in a manner which is somewhat 
unconventional. In some ways, particularly in developing a concep- 
tual understanding of quantum mechanics, it may be preferred, but 
in others, e.g., in making computations for the simpler problems and 
for understanding the literature, it is disadvantageous. 

The more conventional view, via the Schrödinger equation, is 
already presented in many books, but the views to be presented here 
have appeared only in abbreviated form in papers in the journals.’ 

1R. P. Feynman, The Space-Time Approach to Non-relativistic Quantum 
Mechanics, Rev. Mod. Phys., vol. 20, p. 367, 1948. 
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A central aim of this book is to collect this work into one volume where 
it may be expanded with sufficient clarity and detail to be of use to 
the interested student. 

In order to keep the subject within bounds, we shall not make a 
complete development of quantum mechanics. Instead, whenever a 
topic has reached such a point that further elucidation would best be 
made by conventional arguments appearing in other books, we refer 
to those books. Because of this incompleteness, this book cannot 
serve as a complete textbook of quantum mechanics. ` It can serve as 
an introduction to the ideas of the subject if used in conjunction with 
another book that deals with the Schrödinger equation, matrix 
mechanics, and applications of quantum mechanics. 

On the other hand, we shall use the space saved (by our not devel- 
oping all of quantum mechanics in detail) to consider the application 
of the mathematical methods used in the formulation of quantum 
mechanics to other branches of physics. 

It is a problem of the future to discover the exact manner of comput- 
ing amplitudes for processes involving the apparently more compli- 
cated particles, namely, neutrons, protons,and mesons. Of course, one 
can doubt that, when the unknown laws are discovered, we shall find 
ourselves computing amplitudes at all. However, the situation today 
does not seem analogous to that preceding the discovery of quantum 
mechanics. 

In the 1920’s there were many indications that the fundamental 
theorems and concepts of classical mechanics were wrong, i.e., there 
were many paradoxes. General laws could be proved independently 
of the detailed forces involved. Some of these laws did not hold. 
For example, each spectral line showed a degree of freedom for an 
atom, and at temperature T each degree of freedom should have an 
energy kT, contributing R to the specific heat. Yet this very high 
specific heat expected from the enormous number of spectral lines did 
not appear. 

Today, any general law that we have been able to deduce from the 
principle of superposition of amplitudes, such as the characteristics of 
angular momentum, seems to work. But the detailed interactions 
still elude us. This suggests that amplitudes will exist in a future 
theory, but their method of calculation may be strange to us. 
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The Quantum-mechanical 
Law of Motion 


In this chapter we intend to complete our specification of nonrelati- 
vistic quantum mechanics which we began in Chap. 1. There we 
noted the existence of an amplitude for each trajectory; here we shall 
give the form of the amplitude for each trajectory. For a while, for 
simplicity, we shall restrict ourselves to the case of a particle moving 
in one dimension. ‘Thus the position at any time can be specified by 
a coordinate x, a function of t. By the path, then, we mean a func- 
tion z(t). 

If a particle at an initial time ta starts from the point za and goes 
to a final point 2, at time t, we shall say simply that the particle goes 
from a to b and our function z(t) will have the property that x(t.) = £a 
and x(t) = .ننه‎ In quantum mechanics, then, we shall have an ampli- 
tude, often called a kernel, which we may write K(b,a), to get from the 
point a to the point b. This will be the sum over all of the trajectories 
that go between the end points a and 6 of a contribution from each. 
This is to be contrasted with the situation in classical mechanics in 
which there is only one specific and particular trajectory which goes 
from a to b, the so-called classical trajectory, which we shall label £(é). 
Before we go on to give the rule for the quantum-mechanical case, 
let us remind ourselves of the situation in classical mechanics. 


THE CLASSICAL ACTION 


One of the most elegant ways of expressing the condition that deter- 
mines the particular path Z(t) out of all the possible paths is the prin- 
ciple of least action. That is, there exists a certain quantity S which 
can be computed for each path. The classical path 2 is that for which 
S is a minimum. Actually, the real condition is that S be merely an 
extremum. That is to say, the value of S is unchanged in the first 
order if the path Z(t) is modified slightly. 
The quantity S is given by the expression 


S = f] Lita) dt (2-1) 


where L is the lagrangian for the system. For a particle of mass m 
moving in a potential V (x,t), which is a function of position and time, 
the lagrangian is 


L = Fa - V(x) (2-2) 


The form of the extremum path z(t) is determined with the usual pro- 
cedures of the calculus of variations. Thus, suppose the path is varied 


26 


The quantum-mechanical law of motion 


27 


away from 2 by an amount 6z(t); the condition that the end points 
of @ are fixed requires 


dx(ta) = dx(ts) = 0 (2-3) 
The condition that 2 be an extremum of S means 
ôS = S[# + ôx] — S[z] = 0 (2-4) 


to first order in êz. Using the definition of Eq. (2-1) we may write 


Siz + 82] = fiL + d4,2 + is t) at 


oa to 5 3 ðL ðL 
= ji EZ T ôt z + OL = | dt 


of. OL ðL 


Upon integration by parts, the variation in S becomes 


ðL tb to d ðL ðL 


Since êz is 0 at the end points, the first term on the right-hand side 
of this equation is 0. Between the end points 6z can take on any 
arbitrary value. Thus the extremum is that curve along which the 
following condition is always satisfied: 


d {ðL ðL 
ii (32) Semen 1 


This is, of course, the classical lagrangian equation of motion. 

In classical mechanics, the form of the action integral S = fL dt is 
interesting, not just the extreme value Sa. This interest derives from 
the necessity to know the action along a set of neighboring paths in 
order to determine the path of least action. 

In quantum mechanics both the form of the integral and the value 
of the extremum are important. In the following problems we shall 
evaluate the extremum in a variety of situations. 


Problem 2-1 For a free particle L = mz?/2. Show that the 
action Sa corresponding to the classical motion of a free particle is 


Sa = z (% — Ta)? (2-8) 
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Problem 2-2 For a harmonic oscillator L = (m/2)(4? — wa? . 
With T equal to t — ta show that the classical action is 


mo) 


2 D ee = 2, 2-9 
Sa Tain of (e + °) cos wT - Piate (2-9) 


Problem 2-3 Find Sq for a particle under a constant force F, that 
is, L = mz?/2 — Fr. 


Problem 2-4 Classically, the momentum is defined as 
DT )2-10( 
Show that the momentum at an end point is 
(5) E (2-11) 
OL Jaag, OX 


Hint: Consider the effect on Eq. (2-6) of a change in end points. 


Problem 2-5 Classically, the energy is defined as 
E = L — ip (2-12) 


Show that the energy at an end point is 


Teen (32) yg: (2-13) 


Hint: A change in the time of an end point requires a change in path, 
since all paths must be classical paths. 


THE QUANTUM - MECHANICAL AMPLITUDE 


Now we can give the quantum-mechanical rule. We must say how 
much each trajectory contributes to the total amplitude to go from 
a tob. It is not that just the particular path of extreme action con- 
tributes; rather, it is that all the paths contribute. They contribute 
equal amounts to the total amplitude, but contribute at different 
phases. The phase of the contribution from a given path is the action 
S for that path in units of the quantum of action ñ. That is, to sum- 
marize: The probability P(b,a) to go from a point £a at the time ta to 
the point x, at t, is the absolute square P(b,a) = |K(b,a)|? of an ampli- 
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tude K(b,a) to go from a to b. This amplitude is the sum of contri- 
bution ¢[z(t)] from each path. 


Ka) = } kO] (2-14) 


over all paths 
from a to b 


The contribution of a path has a phase proportional to the action S: 
¢[x(t)] = const e/*)SizO) (2-15) 


The action is that for the corresponding classical system [see Eq. 
(2-1)]. The constant will be chosen to normalize K conveniently, and 
it will be taken up later when we discuss more mathematically just 
what we mean in Eq. (2-14) by a sum over paths. 


THE CLASSICAL LIMIT 


Before we go on to making the mathematics more complete, we shall 
compare this quantum law with the classical rule. At first sight, from 
Eq. (2-15) all paths contribute equally, although their phases vary, 
so it is not clear how, in the classical limit, some particular path 
becomes most important. The classical approximation, however, cor- 
responds to the case that the dimensions, masses, times, etc., are so 
large that Sisenormousin relation to (= 1.05 X 10-?7erg-sec). Then 
the phase of the contribution 5/7 is some very, very large angle. The 
real (or imaginary) part of ¢ is the cosine (or sine) of this angle. This 
is as likely to be plus as minus. Now if we move the path as shown 
in Fig. 2-1 by a small amount 6z, small on the classical scale, the change 
in S is likewise small on the classical scale, but not when measured in 
the tiny unit A. These small changes in path will, generally, make 
enormous changes in phase, and our cosine or sine will oscillate exceed- 
ingly rapidly between plus and minus values. The total contribution 
will then add to zero; for if one path makes a positive contribution, 
another infinitesimally close (on a classical scale) makes an equal 
negative contribution, so that no net contribution arises. 

Therefore, no path really needs to be considered if the neighboring 
path has a different action; for the paths in the neighborhood cancel 
out the contribution. But for the special path 7Z, for which S is an 
extremum, a small change in path produces, in the first order at least, 
no change in S. All the contributions from the paths in this region 
are nearly in phase, at phase Sa, and do not cancel out. Therefore, 
only for paths in the vicinity of can we get important contributions, 
and in the classical limit we need only consider this particular trajec- 
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x(t) 


x(t) 


x 


Fig. 2-1 The classical path 1, Z(t), is that for which a certain integral, the action S, is 
minimum. If the path is varied by éz(t), to path 2, the integral suffers no first-order 
change. This determines the equation of motion. 

In quantum mechanics, the amplitude to go from a to b is the sum of amplitudes-for each 
interfering alternative path. The amplitude for a given path, e‘S/*, has a phase propor- 
tional to the action. 

If the action is very large compared to A, neighboring paths such as 3 and 4 have slightly 
different actions. Such paths will (because of the smallness of f) have very different 
phases. Their contributions will cancel out. Only in the vicinity of the classical path 
Z(t), where the action changes little when the path varies, will neighboring paths, such 8 
1 and 2, contribute in the same phase and constructively interfere. That is why the 
approximation of classical physics—that only the path Z(t) need be considered—is valid 
when the action is very large compared to h. 


tory as being of importance. In this way the classical laws of motion 
arise from the quantum laws. 

We may note that trajectories which differ from 2 contribute as 
long as the action is still within about 7 of Sa. The classical trajec- 
tory is indefinite to this slight extent, and this rule serves as a measure 
of the limitations of the precision of the classically defined trajectory. 

Next consider the dependence of the phase on the position of the 
end point (x,t). If we change the end point a little, this phase changes 
a great deal, and K(b,a) changes very rapidly. If by a “smooth 
function” we mean one like Sa which changes only when changes in 
argument which are appreciable on a classical scale are made, we note 
that K(b,a) is far from smooth, but in this classical approximation 
our arguments show that it is T the form 


K(b,a) = “smooth function” ٠ et/® Sa (2-16) 


All these approximate considerations apply to a situation on a scale 
for which classical physics might be expected to work (S >> i). But 
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at an atomic level, S may be comparable with 7, and then all trajec- 
tories must be added in Eq. (2-14) in detail. No particular trajectory 
is of overwhelming importance, and of course Eq. (2-16) is not neces- 
sarily a good approximation. To deal with such cases, we shall have 
to find out how to carry out such sums as are implied by Eq. (2-14). 


THE SUM OVER PATHS 


Analogy with the Riemann Integral. Although the qualitative 
idea of a sum of a contribution for each of the paths is clear, a more 
precise mathematical definition of such a sum must be given. The 
number of paths is a high order of infinity, and it is not evident what 
measure is to be given to the space of paths. It is our purpose in 
this section to give such a mathematical definition. This definition 
will be found rather cumbersome for actual calculation. In the suc- 
ceeding chapters we shall describe other and more efficient methods 
of computing the sum over all paths. As for this section, it is hoped 
that the mathematical difficulty, or rather inelegance, will not distract 
the reader from the physical content of the ideas. 

We can begin our understanding with a consideration of the ordi- 
nary Riemann integral. We could say, very roughly, that the area 
A, under a curve, is the sum of all its ordinates. Better, we could 
say that it is proportional to that sum. But to make the idea pre- 
cise, we do this: take a subset of all ordinates (e.g., those spaced at 
equal intervals h). Adding these ordinates, we obtain 


A ~} f(a) (2-17) 


where the summation is carried out over the finite set of points 2;, as 
shown in Fig. 2-2. 

The next step is to define A as the limit of this sum as the subset 
of points, and thus the subset of ordinates, becomes more complete 
or—because a finite set is never any measurable part of the infinite 
continuum—we may better say as the subset becomes more represent- 
ative of the complete set. We can pass to the limit in an orderly 
manner by taking continually smaller and smaller values of h. In so 


١ doing, we would obtain a different sum for each value of h. No limit 


exists. In order to obtain a limit to this process, we must specify 
some normalizing factor which should depend on h. Of course, for 
the Riemann integral, this factor is just h itself. Now the limit exists 
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Fig. 2-2 In the definition of the ordinary Riemann integral, a set of ordinates is drawn 
from the abscissa to the curve. The ordinates are spaced a distance h apart. The integral 
(area between the curve and the abscissa) is approximated by h times the sum of the ordi- 


This approximation approaches the correct value as h approaches zero. 


nates. 


An analogous definition can be used for path integrals. The measure which goes to zero 
in the limit process is the time interval e between discrete points on the paths. 


and we may write the expression 


A= lim [% 0 fa) | (2-18) 


Constructing the Sum. We can follow through an analogous 
procedure in defining the sum over all paths. First, we choose a sub- 
set of all paths. To do this, we divide the independent variable time 
into steps of width e. This gives us a set of values t; spaced a distance 
e apart between the values ta and t. At each time t; we select some 
special point .ره‎ We construct a path by connecting all the points so 
selected with straight lines. It is possible to define a sum over all 
paths constructed in this manner by taking a multiple integral over 
all values of x; for i between 1 and N — 1, where 


Ne = th — ta 
e= tai = & 

2-19 
to ج‎ 1 tw = to ( ) 
to = Ta tn = Do 

The resulting equation is 
KOSE ل‎ olz(t)] dzı dzz ٠ ° ° dzx-ı (2-20) 


We do not integrate over zo or zy because these are the fixed end 
points za and x. This equation corresponds formally to Eq. (2-17). 
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In the present case we can obtain a more representative sample of the 
complete set of all possible paths between a and b by making e smaller. 
However, just as in the case of the Riemann integral, we cannot pro- 
ceed to the limit of this process because the limit does not exist. Once 
again we must provide some normalizing factor which we expect will 
depend upon e. 

Unfortunately, to define such a normalizing factor seems to be a 
very difficult problem and we do not know how to do it in general 
terms. But we do know how to give the definition for all situations 
which so far seem to have practical value. For example, take the 
case where the lagrangian is given by Eq. (2-2). The normalizing 
factor turns out to be A-Y, where 


ee ونم‎ (2-21) 


We shall see later (e.g., Sec. 4-1) how this result is obtained. With 
this factor the limit exists and we may write 


5 j | 5 dz dz» dIN-1 
= uh FA: GIA) S [b,a] re j 
BEDS) lim 7 Jf fe A A A a 
where 
to 
Sib,a] = |, LG@,2,t) at (2-23) 


is a line integral taken over the trajectory passing through the points 
zx; with straight sections in between, as in Fig. 2-3. 


Fig. 2-3 The sum over 
paths is defined as a limit, 
in which at first the path is 
specified by giving only its 
coordinate ند‎ at a large num- 
ber of specified times sepa- 
rated by very small intervals 
e. The path sum is then an 
integral over all these spe- 
cific coordinates. Then to 
achieve the correct measure, 
the limit is taken as e 
approaches 0. 
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It is possible to define the path in a somewhat more elegant manner. 
Instead of straight lines between the points 2 and 2 + 1, we could use 
sections of the classical orbit. Then we could say that S is the mini- 
mum value of the integral of the lagrangian over all the paths which 
go through the specified points (a;,t;). With this definition no recourse 
is made to arbitrary straight lines. 


The Path Integral. There are many ways to define a subset of all 
the paths between a and b. The particular definition we have used 
here may not be the best for some mathematical purposes. For exam- 
ple, suppose the lagrangian depends upon the acceleration of x. In 
the way we have constructed the path, the velocity is discontinuous 
at the various points (2;,t;); that is, the acceleration is infinite at these 
points. It is possible that this situation would lead to trouble. How- 
ever, in the few such examples with which we have had experience the 
substitution 


1 : 
)2-24( )2 دك (a‏ لال 


has been adequate. There may be other cases where no such substi- 
tution is available or adequate, and the present definition of a sum 
over all paths is just too awkward to use. Such a situation arises in 
ordinary integration in which the Riemann definition, as in Eq. (2-18), 
is not adequate and recourse must be had to some other definition, 
such as that of the Lebesgue. 

The necessity to redefine the method of integration does not destroy 
the concept of integration. So we feel that the possible awkwardness 
of the special definition of the sum over all paths [as given in Eq. 
(2-22)] may eventually require new definitions to be formulated. 
Nevertheless, the concept of the sum over all paths, like the concept 
of an ordinary integral, is independent of a special definition and valid 
in spite of the failure of such definitions. Thus we shall write the sum 
over all paths in a less restrictive notation as 


b 
K(b,a) = f’ ems )ده‎ (2-25) 
which we shall call a path integral. The identifying notation in this 


expression is the script D. Only rarely shall we return to the form 
given in Eq. (2-22). 


Problem 2-6 The class of functionals for which path integrals can 
be defined is surprisingly varied. So far we have considered func- 
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Fig. 2-4 The path of a relativistic particle 
traveling in two dimensions is a zigzag of 
straight segments. The slope of the seg- 
ments is constant in magnitude and differs 
only in sign from zig to zag. The amplitude 
for a particular path, as well as the kernel to 
go from a to b, depends on the number of 
corners 2 along a path, as shown by Eqs. 
(2-26) and (2-27). 


tionals such as that given in Eq. (2-15). Here we shall consider quite 
a different type. This latter type of functional arises in a one-dimen- 
sional relativistic problem. Suppose a particle moving in one dimen- 
sion can go only forward or backward at the velocity of light. For 
convenience, we shall define the units such that the velocity of light, 
the mass of the particle, and Planck’s constant are all unity. Then 
in the zt plane all trajectories shuttle back and forth with slopes of 
+45°, as in Fig. 2-4. The amplitude for such a path can be defined 
as follows: Suppose time is divided into small equal steps of length e. 
Suppose reversals of path direction can occur only at the boundaries 
of these steps, i.e., at t = ta + ne, where n is an integer. For this 
relativistic problem the amplitude to go along such a path is different 
from the amplitude defined in Eq. (2-15). The correct definition for 
the present case is 


@ = (te)® (2-26) 


where R is the number of reversals, or corners, along the path. 

As a problem, the reader may use this definition to calculate the 
kernel K(b,a) by adding together the contribution for the paths of one 
corner, two corners, etc. Thus determine 


K(b,a) = X N(R) (ie)? (2-27) 
R 


where N(R) is the number of paths possible with R corners. It is 
best to calculate four separate K’s, namely, the amplitude K,,(b,a) 
of starting at the point a with a positive velocity and coming into the 
point b with a positive velocity, the amplitude K,_(b,a) of starting 
at the point a with a negative velocity and coming into the point b 
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with a positive velocity, and the amplitudes K_, and K_~_ defined in 
a similar fashion. 

Next suppose the unit of time is defined as 7/762. If the time inter- 
val is very long (ts — ta >> h/mc?) and the average velocity is small 
[£e — ta >> c(t — ta)], show that the resulting kernel is approximately 
the same as that for a free particle [given in Eq. (3-3)], except for a 
factor exp [(imc?/h) (ta — t%)]. The definition given here for the ampli- 
tude, and the resulting kernel, is correct for a relativistic theory of a 
free particle moving in one dimension. The result is equivalent to 
the Dirac equation for that case. 


EVENTS OCCURRING IN SUCCESSION 


The Rule for Two Events. In this section we shall derive an 
important law for the composition of amplitudes for events which 
occur successively in time. Suppose t. is some time between ta and t. 
Then the action along any path between a and b can be written as 


S[b,a] = S[b,c] + S[c,a] (2-28) 


This follows from the definition of the action as an integral in time 
and also from the fact that L does not depend on derivatives higher 
than the velocity. (Otherwise, we would have to specify values of 
velocity and perhaps higher derivatives at point c.) Using Eq. (2-25) 
to define the kernel, we can write 


K(b,a) = fe/MSl.l+G/%Sle,a] Dy (t) (2-29) 


It is possible to split any path into two parts. The first part would 
have the end points za and z. = x(t), and the second part would have 
the end points x, and zs, as shown in Fig. 2-5. It is possible to inte- 
grate over all paths from a to c, then over all paths from c to b, and 
finally integrate the result over all possible values of ze. In perform- 
ing the first step of this integration S[b,c] is constant. Thus the result 
can be written as 


K(b,a) = j, f} esmsax(c,a) Da(t) dze (2-30) 


where integrations must now be carried out not only over paths 
between c and b but also over the variable end point že. In the next 
step we carry out the integration over all paths between some point 
with an arbitrary xe and the point b. All that is left is an integral 
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Fig. 2-5 One way the sum over all 
paths can be taken is by first summing 
over paths which go through the 
point at xe and time t and later 
summing over the points ze The 
amplitude on each path that goes 
from a to b via c is a product of two 
factors: (1) an amplitude to go from 
a to c and (2) an amplitude to go from 
c to b. This is therefore valid also 
for the sum over all paths through c: 
the total amplitude to go from a to b 
via cis K(b,c)K(c,a). Thus summing 
over the alternatives (values of ze), 
we get for the total amplitude to go 
from a to b, Eq. (2-31). 


over all possible values of xe.. Thus 
K(b,a) = | K(b,0)K(c,a) dz. (2-31) 


Perhaps the argument is clearer starting from Eq. (2-22). Select 
one of the discrete times as t. Thus let t = t and xe = .يرنه‎ First 
carry out all the integrations over z;, such that 2 > k. This will intro- 
duce the factor in the integral K(c,a). Next carry out the integrals 
over all x;, such that i > k. This introduces the factor K(b,c). All 
that is left is an integral over z.. Theresult can be written as Eq. (2-31). 

This result can be summarized in the following way. All alterna- 
tive paths from a to b can be labeled by specifying the position ونه‎ 
through which they pass at the time te. Then the kernel for a particle 
going from a to b can be computed from the rules: 


1. The kernel to go from a to b is the sum, over all possible values 
of x, of amplitudes, for the particle to go from a to © and then to b. 

2. The amplitude to go from a to c and then to b is the kernel to 
go from a to c times the kernel to go from c to b. 


Thus we have the rule: Amplitudes for events occurring in succession 
in time multiply. 


Extension to Several Events. There are many applications for 
this important rule, and several will be developed in succeeding 
chapters. Here we shall show the application wherein we follow an 
alternative route in deriving the equation for the kernel, Eq. (2-22). 
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It is perfectly possible to make two divisions in all the paths: one 
at te and the other at, say, ta. Then the kernel for a particle going 
from a to b can be written as 


Ka) = j, [KOK (MK (Ga) dze dza (2-32) 


This means that we look at a particle which goes from a to 6 as if it 
went first from a to d, then from d to c, and finally from c to b. The 
amplitude to follow such a path is the product of the kernels for each 
part of the path. The kernel taken over all such paths that go from 
a to b is obtained by integrating this product over all possible values 
of x, and Ta. 

We can continue this process until we have the time scale divided 
into N intervals. The result is 


Kiba) = f f+: ibe KOON TON Nee ee 
K(i + 1,i) - ° ° K(,a) drıdzə ٠ ٠ + dry_1 (2-88) 


This means that we can define the kernel in a manner different from 
that given in Eq. (2-22). In this alternative definition the kernel for 
a particle to go between two points separated by an infinitesimal time 
interval e is 


: ل كفو‎ 26 Bier = te Dea a 1 FG 
K@+ 1,2) = g P E L (G24 eee Wa ee (2-34) 
which is correct to first order in e. Then by the rules for multiplying 
amplitude of events which occur successively in time, we have 


N-1 


4[x(t)] = lim I] K@+1,% (2-35) 
© i=0 


for the amplitude of a complete path. Then, using the rule that ampli- 
tudes for alternative paths add, we arrive at a definition for K (b,a). 
It can be seen that the resulting expression is actually the same as 
Eq. (2-22). 


SOME REMARKS 


In the relativistic theory of the electron we shall not find it possible 
to express the amplitude for a path as e*S/*, or in any other simple 
way. However, the laws for combining amplitudes still work (with 
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some small modifications). The amplitude for a trajectory still exists. 
As a matter of fact, it is still given by Eq. (2-35). The only differ- 
ence is that K(z + 1, 2( is not so easily expressed in a relativistic theory 
as it is in Eq. (2-34). The complications arise from the necessity to 
consider spin and the possibility of the production of pairs of electrons 
and positrons. 

In nonrelativistic systems with a larger number of variables, and 
even in the quantum theory of electromagnetic field, not only do the 
laws for combining amplitudes still hold but the amplitude itself fol- 
lows the rules set down in this chapter. That is, each motion of a 
variable has an amplitude whose phase is 1/7 times the action associ- 
ated with the motion. 

We shall take up these more complicated examples in later chapters. 
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Developing the Concepts 
with Special Examples 


In this chapter we shall develop the kernels governing some special 
types of motion. We shall explore the physical meaning of the mathe- 
matical results in order to develop some physical intuition about 
motion under quantum-mechanical laws. The wave function will be 
introduced and its relation to the kernel will be described. This rep- 
resents the first step in connecting the present approach to quantum 
mechanics with the more traditional approaches. 

We shall also introduce some special mathematical methods for com- 
puting the sum over all paths. The idea of a sum over all paths was 
described in Chap. 2 with the help of a particular computational 
method. Although that method may clarify the concept, it is an awk- 
ward tool with which to work. The simpler methods to be introduced 
in this chapter will be of great use in our future work. 

Thus the present chapter has three purposes: deepening our under- 
standing of quantum-mechanical principles, beginning the connection 
between our present approach and other approaches, and introducing 
some useful mathematical methods. 


THE FREE PARTICLE 


The Path Integral. The method used in Chap. 2 to describe a 
sum over all paths will be used here to compute the kernel for a free 
particle. The lagrangian for a free particle is 


£2 
Thus with the help of Eqs. (2-21) to (2-23) the kernel for a free par- 
ticle is 
im Š 
K(ba) = tim ff +++ f exp [ae إل‎ © - 2-0] 


7 —N/2 
dt, ` ٠ ° dzy (2) (3-2) 


This represents a set of gaussian integrals, i.e., integrals of the form 
Jlexp (—az?)] dz or f[exp (—~az? + bx)] dz. Since the integral of a 
gaussian is again a gaussian, we may carry out the integrations on one 
variable after the other. After the integrations are completed, the 
limit may be taken. The result is 


2rth (te — ta) |2 ; = 2,)? 
Ka) = | 9 | exp ye (3-3) 
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The calculation is carried out as follows. Notice first that 


8 م‎ exp {ae a اس المت‎ 0| a 


Next we multiply this result by 


(zz) exp E (£3 — 2| (3-5) 


and integrate again, this time over .ون‎ The result is similar to that 
of Eq. (3-4), except that (x2 — xo)? becomes (x3 — xo)? and the expres- 
sion 2e is replaced by 3e in two places. Thus we get 


2717 ٠ 3e\—% m (Geo? 
Sree) lage E 


In this way a recursion process is established which, after n — 1 steps, 
gives 


Qrihne\—% m (Gk ceed) 
m REA و‎ 


Since ne = tn — to, it is easy to see that the result after N — 1 steps 
is identical with Eq. (3-3). 

There is an alternative procedure. Equation (3-4) can be used to 
integrate over all the variables x; for which 7 is odd (assuming N is 
even). The result is an expression formally like Eq. (8-2), but with 
half as many variables of integration. The remaining variables are 
defined at points in time spaced a distance 2e apart. Hence, at least 
in the case that N is of the form 2*, Eq. (3-3) results from k steps of 
this kind. 


Problem 3-1 The probability that a particle arrives at the point 
b is by definition proportional to the absolute square of the kernel 
K(b,a). For the free-particle kernel of Eq. (3-3) this is 


m 

Ce ees 3-6 

P(b) dx Wi TT) dx (3-6) 
Clearly this is a relative probability, since the integral over the com- 
plete range of x diverges. What does this particular normalization 
mean? Show that this corresponds to a classical picture in which a 
particle starts from the point a with all momenta equally likely. Show 
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Fig. 3-1 The real part of amplitude to arrive at various distances z from the origin after a 


The imaginary part (not shown) is an analogous wave 90° out of phase, so that the 


time t. 


absolute square of the amplitude is constant. The wavelength is short at large z, namely, 
where a classical particle could arrive only if it moved with high velocity. Generally, the 
wavelength and classical momentum are inversely related [Eq. (3-10)]. 


that the corresponding relative probability that the momentum of the 
particle lies in the range dp is dp/2rħ. 


Momentum and Energy. We now study some of the implications 
of the free-particle kernel. For convenience let the point a represent 
the origin in both time and space. The amplitude to go to some other 
point b = (x,t) is 


TA ole 
K(2,t;0,0) = C=) ima 


(3-7) 


If time is fixed, the amplitude varies with distance as shown in Fig. 
3-1, in which the real part of Eq. (3-7) is plotted. 

We see that as we get farther from the origin the oscillations become 
more and more rapid. If v is so large that many oscillations have 
occurred, then the distance between successive nodes is nearly con- 
stant, at least for the next few oscillations. That is, the amplitude 
behaves much like a sine wave of slowly varying wavelength. It is 
interesting to evaluate this wavelength à. Changing x by à must 
increase the phase of the amplitude by 2r. That is, 


_ ma +a)? _ mx? _ mad mN 


3 A EOE ل‎ CORE م‎ 
Neglecting the quantity à? relative to zd (that is, assuming z > A), 
we find 
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From a classical point of view a particle which moves from the origin 
to x in the time interval ¢ has a velocity z/t and a momentum mx/t. 
From the quantum-mechanical point of view, when the motion can 
be adequately described by assigning a classical momentum to the 
particle of p = mz/t, then the amplitude varies in space with the 
wavelength . 


h 

A > (3-10) 

We may show this relation still more generally. Suppose we have 
some large piece of apparatus, such as a magnetic analyzer, which is 
supposed to bring particles of a given momentum p to a given point. 
We shall show that, whenever the apparatus is large enough that clas- 
sical physics offers a good approximation, then the amplitude for a 
particle to arrive at the prescribed point varies rapidly in space with 
a wavelength equal to h/p. For as we have seen, in such a situation, 
the kernel is approximated by 


ages E ,مامه‎ | (3-11) 


Changes in the position of the final point x, cause variations in the 
classical action. If this action is large compared to 7 (the semiclassical 
approximation), the kernel will oscillate very rapidly with changes 
in .نت‎ The change in phase per unit displacement of the end point is 


k= (3-12) 


سم | 
به 
3 


but نندة /روكرة‎ is the classical momentum of the particle when it arrives 
at the point x» (see Prob. 2-4). Thus p = fk. This quantity k, the 
phase change per unit distance of a wave, is called the wave number, 
and it is very convenient to use. Since the wavelength is the distance 
over which the phase changes by 27, then k = 27/A. Equation (3-12) 
is de Broglie’s formula relating the momentum to the wave number of 
a wave, p = hk. 

Next, let us study the time dependence of the free-particle kernel 
given by Eq. (3-7). Suppose we hold the distance fixed and vary the 
time. The variation of the real part of the kernel is shown in Fig. 3-2. 
Both frequency and amplitude change with t. 

Suppose ¢ is very large and neglect the change in amplitude with 
variations of t The period of oscillation T is defined as the time 
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Fig. 3-2 The amplitude 
to find the particle at a 
given point varies with 
time. The real part is 
plotted here. The fre- 
quency of the oscillations 

t is proportional to the en- 
ergy that a classical par- 
ticle would have to have 
to arrive at the point in 
question within the time 
interval t. 


required to increase the phase by 2r. Thus 


max? ma? l 7 ) 


20 = Ohi ThE TT) EIT 


By introducing the angular frequency w = 2r/T, and assuming t > T, 
we can write this equation as 


(3-13) 


m {x\? 
Since m(a/t)?/2 is the classical energy of a free particle, this equation 
says 
Energy = hw (3-15) 


This relation, like the one relating momentum and wavelength, 
holds for any apparatus which can be adequately described by classical 
physics; and, like the previous relation, it can be obtained from a more 
general argument. 

Referring to Eq. (3-11), any variation of the time t of an end point 
will cause a rapid oscillation of the kernel. The resulting frequency is 


1 3S, 
w= 5 ee (3-16) 


The quantity 0S./dt is interpreted classically as the energy E (refer 
to Prob. 2-5). Thus 


E 
ارين‎ (3-17) 
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In this way the concepts of momentum and energy are extended to 
quantum mechanics by the following rules: 


1. If the amplitude varies as e7, we say that the particle has 
i heen hk. 
. If the amplitude has a definite frequency varying in time as et, 
we say the energy is ñw. 


We have just shown that this rule will agree with the definition of 
energy and momentum in the classical limit. 


Problem 3-2 Show by substitution that the free-particle kernel 
K(b,a) satisfies the differential equation 
h dK(b,a) h? aK (b,a) 


tort CAE)‏ ا 


whenever t is greater than ta. 


DIFFRACTION THROUGH A SLIT 


The Conceptual Experiment. We can learn more about the 
physical interpretation of quantum mechanics and its relation to clas- 
sical mechanics by considering another somewhat more complicated 
example. Suppose a particle is liberated at t = 0 from the origin and 
then, at an interval of time T later, we observe that it is at a certain 
point xo. Classically, we would say that the particle has had a velocity 
vo = Xo/T. The implication would be that if a particle were left alone 
to continue for another interval of time 7, it would move an additional 
distance vor. To analyze this quantum-mechanistically, we shall 
attempt to solve the following problem: 

Att = 0 the particle starts from the origin z = 0. After an interval 
T we shall suppose that it is known that the particle is within the dis- 
tance +b of zo. We ask: After an additional interval r, what is the 
probability of finding the particle at an additional displacement x from 
the position xo? The net amplitude to arrive at the position z at the 
time t + 7 can be considered as the sum of contributions from every 
trajectory that goes from the origin to the final point, provided that 
that trajectory lies in the interval +b from zo at the time T. 

We shall calculate this in a moment, but first it is worth remarking 
on what kind of an experiment we are contemplating here. How can 
we know that the particle passes the point zo within the interval +b? 


-One way would be to make an observation of the particle at the time 
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Fig. 3-3 A particle starting at x = 0 when t = 0 is determined to pass between zo — b 
and zo + b at t = T. We wish to calculate the probability of finding the particle at some 
point z at a time r sec later, i.e., when t = T + r. According to classical laws, the particle 
would have to be between zo(r/T) + 5)1 +7/T) and zo(r/T) — b(1 + 7/T), that is, 
between the rectilinear extensions of the original slit. However, quantum-mechanical 
laws show that such particles have a nonzero probability of appearing outside these 
classical limits. 

We cannot approach this problem by a single application of the free-particle law of 
motion, since the particle is actually constrained by the slit. So we break the problem up 
into two successive free-particle motions. The first takes the particle from x = 0 att = 0 
to x = zo + y at t = T, where |y| <b. The second takes the particle from zo + y at 
t= Ttoxatt = T +r. The overall amplitude is an integral of the product of these two 
free-particle kernels, as shown by Eq. (3-19). 


T to see if it is within the interval +b. This would be the most natu- 
ral way to proceed, but it is somewhat more difficult to analyze 
in detail (because of the complicated interaction between the elec- 
tron and the observing mechanism) than another way of doing the 
experiment. 

Suppose we look, say, with very strong light, everywhere all along 
the x axis except within +b from the point xo at the time T. If we 
find the particle, we discontinue the experiment. We consider only 
those cases in which a thorough investigation of the region, except for 
the region +b, shows no particle is present. That is, all trajectories 
which pass outside the limits +b from ونه‎ are rejected. The experi- 
mental situation is diagramed in Fig. 3-3. The amplitude then can 
be written as 


Wa) = J, K@ + zo T + ونه زج‎ + y, T)K (eo +y, T;0,0) dy (3-19) 


This expression is written down in accordance with the rule for com- 
bining amplitudes of events occurring in succession in time. The first 
event is that the particle goes from the origin to the slit. The second 
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event is that the particle proceeds from the slit to the point 2: further on. 
The slit has a finite width, and passage through each elemental interval 
of the slit represents an alternative way of proceeding along the com- 
plete path. Thus we must integrate over the width of the slit. All 
particles which miss the slit are captured and removed from the 
experiment. Amplitudes for such particles are not included. All the 
particles which get through the slit move as free particles with kernels 
given by Eq. (3-3). Thus the amplitude is 


re 1 8 رست‎ {exp E3 2) 


EEN [en | "جيم‎ w eao 


This integral can be expressed in terms of Fresnel integrals. Such 
a representation contains the physical results we are after, but in an 
obscure way owing to the mathematical complexity of the Fresnel 
integral form. Rather than confuse the physical results by mathe- 
matical complexity, we shall set up a different, but analogous, expres- 
sion which leads to a simpler mathematical form. 


The Gaussian Slit. Suppose we introduce a function G(y) as a 
factor in the integrand. If this function is defined as unity between 
y = —b and y = +b and zero elsewhere, the limits of integration can 
be extended to infinity without any change in the result. Then 


where 


Gy) = 1 for —b > ن‎ <b 
=0 for |y| > 6 


Instead of this, suppose we define © (y) to be a gaussian function, thus: 
G(y) = ewl (3-22) 


This function has the shape shown in Fig. 3-4. The effective width of 
such a curve is related to the parameter b. For this particular func- 
tion, approximately two-thirds of the area under the curve lies between 
—b and +b. 

We do not know how to design metal parts for our imaginary experi- 
ment which will produce such a gaussian slit. However, there is no 
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Fig. 3-4 The form of the 
gaussian function G(y) = 
e~v?/2b”, The curve has the 
same shape as the normal 
distribution with a stand- 
ard deviation b. 


4 


A 


conceptual diffculty. We now have a situation in which the particles 
at time T are distributed along the z axis with a relative amplitude 
proportional to the function G(y). (The relative probability is pro- 
portional to [G(y)]?.) If the particles move classically, we would 
expect, after a succeeding interval of time r, to find them similarly 
distributed along the x axis with a new center a distance zı beyond 2o 
and an increased width parameter bı given by 


To T 
vi = @r bi = b € a 7) (3-23) 


as shown in Fig. 3-5. 
With such a gaussian slit the equation for the amplitude is 


am fx? TO 
E ١ ek E E 3 7 
imf _ 22 | Xo im wm 1 
zj ~ a z) y 2 + AT 35 || eR 
This integral is of the form 
o0 2 
[exp (ax? + Bxr)] dx = , pe exp — E for Re (a) <0 (3-25) 


which is integrated by completing the square in the exponent. Thus 
the amplitude becomes 


(e [it 8+9‏ + 1 + )| وک = هام 


(im/h)?(—a/7 + zo/T) 
“am BO E En (cae) 
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The classical velocity to get from the origin to the center of the slit 
is vo = %/T. When we use this as a substitution and rearrange some 
of the terms, the expression for the amplitude becomes 


NT 
06 = stg (7 ++ Te) 


im T? (m?/2h?r?) (x — vor) 
exp É (vor TE =) + CONO NE ey Sak Cee ye | (3-27) 

We shall consider first the relative probability for the particle to 
arrive at various points along the x axis. This probability is propor- 
tional to the absolute square of the amplitude. The absolute value 
of an exponent with an imaginary argument is 1. So, by rationalizing 
the second factor and the denominator in the last exponent of Eq. 
(3-27), we obtain 


O (LZ vor)?‏ ا 
P(x) dx = FAT Ax FP Sa" | dx (3-28)‏ 


where we have used the substitution 
م زه‎ E 4 لش‎ 
(Az) b (1 ل‎ 7) + mb? 


rh? 


= 'b,? + ae (3-29) 


Fig. 3-5 The paths of particles moving through a gaussian slit. If the particles obeyed 
classical laws of motion, then the distribution of particles at time T + ع‎ would have the 
same form as the distribution at time T. ‘The difference would be only a spreading out 
proportional to the time of flight. The characteristic width of the distribution would be 
increased from 2b to 2bı, where bı = b(T +7)/T. For quantum-mechanical motion, the 
actual spreading is greater than this. 
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As we expected, the distribution is a gaussian centered about the 
point xı = vor of Eq. (3-23). However, the spread of the distribu- 
tion, Az, is larger than the expected value bı of Eq. (3-23). This can 
be interpreted in the following manner. Suppose a; and az are two 
independent random variables whose root-mean-square deviations 
about their average values are.respectively a; and az. Then if 
a3 = aı + روه‎ the rms deviation of a; about its average value is 
as = (aı + a?)*. Now, the rms deviation in a particular distri- 
bution is a measure of the spread, or width, of the distribution. As 
a matter of fact, for the gaussian distribution e772? the rms value is b. 

Thus in the present case we find that the quantum-mechanical sys- 
tem acts as if it had an extra random variable zı whose rms deviation is 


hr 
Ax, = mp 
It is this extra deviation Az, or spreading, rather than the apparent 
extra variable zı, which has physical significance. That this term is 
quantum-mechanical in nature is clear from the inclusion of the con- 
stant %. Such a term is important for particles of small mass and for 
narrow slits. 

Thus quantum mechanics tells us that for small particles, passage 
through a narrow slit makes the future position uncertain. This 
uncertainty Az, is proportional to the time interval r between passage 
through the slit and the next observation of position. If we introduce 
the classical notion of velocity, we can say that passage through a slit 
causes a velocity uncertainty whose size is 


7 
ôv = Ê (3-31) 


(3-30) 


We could take the width parameter 2b of the slit as a measure of the 
uncertainty of position of the particle at the time it passed through 
the slit. If we call this uncertainty ôx and write the product mv as 
the momentum p, then Eq. (3-31) becomes 


dp ôx = 2h (3-32) 


Once more we have arrived at a statement of the uncertainty 
principle. It tells us that, although classically the velocity might be 
known, the future position has an additional uncertainty as though a 
random momentum had been generated by passing through a slit of 
width ôr. If classical concepts are used to describe the results of 
quantum mechanics qualitatively, then we would say that knowledge 
of position creates uncertainty in momentum. 
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What about the factors that appear in front of the exponent in 
Eq. (3-28)? If we integrate this expression over the complete range 
of x from — o to + œ, the result is 


P(any x) = TIT b Vr (3-33) 


This must ‘be the probability that the particle gets through the slit, 
since the integration includes those particles and only those particles 
which did get through. But we have another way of obtaining this 
result. Suppose we take the absolute square of the kernel K (zo +y, 
T; 0, 0), which comprises the second half of the integrand in Eq. (3-20). 
This is just the probability per unit distance that the particle arrives 
at the point zo + y in the slit. That is, 


P(zo + y) dy = dy (3-34) 


m 
2711 

This probability is independent of the position along the slit. Thus, 
if we were to multiply it by the width of this slit, we would obtain the 
total probability for the particle to arrive at the slit. This implies 
that the effective width of our gaussian slit is V7 5. Had we used 
the original sharp-edged slit, we would find the effective width to be 2b. 


Problem 3-3 By squaring the amplitude given in Eq. (3-20) and 
then integrating over x, show that the probability of passage through 
the original slit is 


P(going through) = ZAT 2b (3-35) 


In the course of this problem the integral 
| e de (3-36) 


willappear. This is the integral representation of the Dirac 6 function 
of a.f 

Thus the quantum-mechanical results agree with the idea that the 
probability that a particle goes through a slit is equal to the proba- 
bility that the particle arrives at the slit. 


Momentum and Energy. Next we shall verify again that, when 
the momentum is definite, the amplitude varies as e**. We return to 


+See the table of integrals in the Appendix and L. I. Schiff, “Quantum 
Mechanics,” 2d ed., pp. 50-52, McGraw-Hill Book Company, New York, 1955. 
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a detailed study of the amplitude given in Eq. (8-26). This time we 
shall try to arrange conditions in our experiment so that the particle 
velocity after passing through the slit is known as accurately as possible. 
Quite apart from any quantum-mechanical considerations, there is 
a classical uncertainty of b/T in the velocity. For any given slit 
width we can make this uncertainty negligible by choosing T very 
large. We can also make zo extremely large so that the average 
velocity zo/T = vo does not go to 0. Considering vo and r as con- 
stants, the expression for the amplitude in the limit of large T is 


yz) = 


const a imz? m?(x — 2م‎ (3-37) 
V1 ل‎ 2 P | fr 4h?r?(am/2hr — 1/2b?) 


Next we must arrange that the quantum-mechanical uncertainty in 
momentum 7/6 is very small. That is, we take b very large, so we can 
neglect 1/b?. Then the amplitude can be written as 


2010 amv? 


y(x) const exp ( EKODI 7 (3-38) 


This is an important result. It says that, if we have arranged things 
so that the momentum of a particle is known to be p, then the ampli- 
tude for the particle to arrive at the point x at the time t is 


y(x) ~ const exp (j px — One ) (3-39) 
h h 2m 

We notice that this is a wave of definite wave number k = p/f. Fur- 

thermore, it has a definite frequency w = p?/2mh. This means we 

can say that a free particle of momentum p has a definite quantum- 

mechanical energy (defined as 7 times frequency) which is 22/272 just 

as in classical mechanics. 

The probability of arriving at any particular x, which is propor- 
tional to the square of the amplitude, is in this case independent of x. 
Thus exact knowledge of velocity means no knowledge of position. 
In arranging the experiment to give an accurately known velocity we 
have lost our chances for an accurate prediction of position. We have 
already seen that the reverse is true. The existence of the quantum- 
mechanical spread, inversely proportional to the slit with 2b, implies 
that an exact knowledge of position precludes any knowledge of 
velocity. So, if you know where it is, you cannot say how fast it is 
going; and, if you know how fast it is going, you cannot say where it is. 
This again illustrates the uncertainty principle. 
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3-3 RESULTS FOR A SHARP-EDGED SLIT 


Leaving the limiting case, suppose we return to a situation in which 
the slit width and quantum-mechanical spread are comparable in size 
and the times and distances of travel are not extremely large. We 
have seen that a gaussian slit leads to a gaussian distribution. If we 
use the more realistic sharp-edged version and work out the resulting 
Fresnel integrals, the probability distribution at the time r after passing 
through the slit is like that shown in Fig. 3-6. 


08 
oe 
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Fig. 3-6 The three curves show the distribution of electrons which have passed through 
sharp-edged slits of various widths. In each case the position of the classically predicted 
width bı, = b(1 + 7/T) is shown by the dashed vertical line. Three different values have 
been chosen for the ratio of the classical width bı to the quantum-mechanical spreading 
Azı: curve (a), bı/Axzı = 15; curve (b), bı/Axzı = 1; and curve (c), bı/Axzı = 145. In each 
case, the distribution spreads beyond the classical width. The rms width of the distribu- 
tion is roughly proportional to Ar = [(Az,)? + (b,)2]”2, 
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This distribution is expressed by 


P(a) de = مستبي‎ 2 (Clu) - Cu? 

+ 3 [S(u1) — S(us)]* de (340) 
where 
a VEL UD tig x — وود‎ + bl + 2/T) (3-41) 


(xhir/m)(1 + 7/T) (xhir/m)(1 + T/T) 


and C(u) and S(u) are the real and imaginary parts of the Fresnel 
integral. The first factor in this probability distribution is identical 
to the probability distribution of a free particle given in Eq. (2-6). 
The remaining factor contains a combination of real and imaginary 
Fresnel integrals." It is this factor which is responsible for the vari- 
ations shown in the curves of Fig. 3-6. 

Thus for both slits the general result is the same. The most prob- 
able place to find the particle is within the classical projection of the 
slit. Beyond this there is the quantum-mechanical spreading. 

We have treated this problem as if it were a combination of two 
separate motions. First the particle goes to the slit, and then it goes 
from the slit to the point of observation. The motion seems almost 
disjointed at the slit. It might be asked then, how does a particle 
with such a disjointed motion “remember” its velocity and head in 
the general direction predicted by classical physics? Or, to put it 
another way, how does making the slit narrower cause a “loss of 
memory” until, in the limit, all velocities are equally likely for the 
particle? 

To understand this, let us investigate the amplitude to arrive at the 
slit. This is just the free-particle amplitude given by Eq. (3-3), with 
Ta = ta = 0, ونه‎ = £o + y, and t = T. As we move across the slit (vary 
y), both real and imaginary parts of the amplitude vary sinusoidally. 
As we have seen, the wavelength of this variation is connected to the 
momentum [refer to Eq. (3-10)]. The subsequent motion is a result 
of optical-like interference among these waves. The interference is 
constructive in the general direction predicted by classical physics and, 
in general, destructive in other directions. 

If there are many wavelengths across the slit (i.e., the slit is very 
wide) the resulting interference pattern is quite sharp and the motion 
is approximately classical. But suppose the slit is made so narrow 


1 Refer to .م‎ 34 of E. Jahnke and F. Emde, “Tables of Functions,” Dover Publica- 
tions, Inc., New York, 1943. 


56 


Developing the concepts with special examples 


57 


that not even one whole wavelength is included. There are no longer 
any oscillations to give an interference, and velocity information is 
lost. Thus in the limit as the slit width goes to zero all velocities are 
equally likely for the particle. 


THE WAVE FUNCTION 


We have developed the amplitude for a particle to reach a particular 
point in space and time by closely following its motion in getting there. 
However, it is often useful to consider the amplitude to arrive at a 
particular place without any special discussion of previous motion. 
Thus we would say y¥(z,t) is the total amplitude to arrive at (x,t) from 
the past in some (perhaps unspecified) situation. Such an amplitude 
has the same probability characteristics as those we have already 
studied; i.e., the probability of finding the particle at the point x and 
at the time t is |y(z,t)|?._ We shall call this kind of amplitude a wave 
function. The difference between this and the amplitudes we have 
studied before is just a matter of notation. One often hears the state- 
ment: The system is in the “state” y. This is just another way of 
saying: The system is described by the wave function y. 

Thus the kernel K(22,t2;x1,t1) = W(22,te) is actually a wave function. 
It is the amplitude to get to (xo,f2). The notation K(x2,te;x1,t1) gives 
us more information, in particular, that this is the amplitude for a 
special case in which the particle came from (zıtı). Perhaps this 
information is of no interest to the problem, so that there is no point 
in keeping track of it. Then we just use the wave function notation 
(2,2). 

Since the wave function is an amplitude, it satisfies the rules for 
combination of amplitudes for events occurring in succession in time. 
Thus since Eq. (2-31) is true for all points (x1,t1), we see that the wave 
function satisfies the integral equation 


(eats) des (3-42)‏ لا(ماردستزواروته) 16 |T,‏ = (وارونة)نل 


This result can be stated in physical terms. The total amplitude 
to arrive at (22,te) [that is, ¥(22,t2)] is the sum, or integral, over all 
possible values of za of the total amplitude to arrive at the point (73,3) 
[that is, ¥(2xs,ts)] multiplied by the amplitude to go from 3 to 2 [that 
is, K(x2,t2;2s,ts)]. This means that the effects of all the past history 
of a particle can be expressed in terms of a single function. If we 
forget everything we knew about a particle except its wave function 
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at a particular time, then we can calculate everything that can happen 
to that particle after that time. All of history’s effect upon the future 
of the universe could be obtained from a single gigantic wave function. 


Problem 3-4 Suppose a free particle has a definite momentum at 
the time t = 0 (that is, the wave function is Ce’r*/^). With the help of 
Eqs. (3-3) and (3-42), show that at some later time the particle has 
the same definite momentum (i.e., the wave function depends on x 
through the function e’?*/") and varies in time as :ةلمع‎ This 
means that the particle has the definite energy p?/(2m). 


Problem 3-5 Use the results of Prob. 3-2 and Eq. (3-42) to show 
that the wave function satisfies the equation 


hoy ها‎ (3-43) 


which is the Schrédinger equation for a free particle. 


GAUSSIAN INTEGRALS 


We are finished with the physical portion of this chapter, and we now 
proceed to mathematical considerations. We shall introduce some 
additional mathematical techniques which will help us to compute 
the sum over paths in certain situations. 

The simplest path integrals are those in which all of the variables 
appear up to the second degree in an exponent. We shall call them 
gaussian integrals. In quantum mechanics this corresponds to a case 
in which the action S involves the path x(t) up to and including the 
second power. 

To illustrate how the method works in such a case, consider a par- 
ticle whose lagrangian has the form 


L = a(t)a? + b(t)jix + c(t)x? + d(x + e(t)x + f(t (3-44) 


The action is the integral of this function with respect to time between 

two fixed end points. Actually, in this form the lagrangian is a little 

more general than necessary. The factor 2 could be removed from 

those terms in which it is linear through an integration by parts, but 

this fact is unimportant for our present purpose. Wewish to determine 
2 


K(b,a) = ii exp E / 3 (bar) | 10) (3-45) 


the integral over all paths which go from (aa,ta) to (2»,ts). 
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Of course, it is possible to carry out this integral over all paths in 
the way which was first described by dividing the region into short 
time elements, and so on. That this will work follows from the fact 
that the integrand is an exponential of a quadratic form in the vari- 
ables and x. Such integrals can always be carried out. But we 
shall not go through this tedious calculation, since we can determine 
the most important characteristics of the kernel in the following 
manner. 

Let Z(t) be the classical path between the specified end points. This 
is the path which is an extremum for the action S. In the notation 
we have been using 


Salb,a] = S[z@] (3-46) 
We can represent ند‎ in terms of 2 and a new variable y: 
f= م‎ + y (3-47) 


That is to say, instead of defining a point on the path by its distance 
z(t) from an arbitrary coordinate axis, we measure instead the devia- 
tion y(t) from the classical path, as shown in Fig. 3-7. 


x 


Fig. 3-7 The difference between the classical path Z(t) and some possible alternative 
path z(t) is the function y(t). Since the paths must both reach the same end points, 
y(t.) = y(t) = 0. In between these end points y(t) can take any form. Since the classical 
path is completely fixed, any variation in the alternative path x(t) is equivalent to the asso- 
ciated variation in the difference y(t). Thus, in a path integral, the path differential Dz () 
can be replaced by Dy(t), and the path z(t) by Z(t) + y(t). In this form 2)( is a constant 
for the integration over paths. Furthermore, the new path variable y(t) is restricted to 
take the value 0 at both end points. This substitution leads to a path integral independent 


of end-point positions. 
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At each t the variables x and y differ by the constant 2. (Of course, 
this is a different constant for each value of t.) Therefore, clearly, 
dx; = dy; for each specific point t; in the subdivision of time. In gen- 
eral, we may say Da(t) = Dy(t). 

The integral for the action can be written 


Sle] = SEO + [0)ن‎ = ® laO G + ون + وقد‎ + °° Jat (348) 


If all the terms which do not involve y are collected, the resulting 
integral is just S[z(é)] = Sa. If all the terms which contain y as a 
linear factor are collected, the resulting integral vanishes. This could 
be proved by actually carrying out the integration (some integration 
by parts would be involved); however, such a calculation is unneces- 
sary, since we already know the result is true. The function Z(t) is 
determined by this very requirement. That is, 2 is so chosen that 
there is no change in S, to first order, for variations of the path around . 
All that remain are the second-order terms in y. These can be easily 
picked out, so that we can write 


S[x(] = Salb,a] + f” laO? + bay + cy] at (3-49) 


The integral over paths does not depend upon the classical path, so 
the kernel can be written 


Kea) = معفم‎ {> (exp [$ [Ë lati? + Mi + oova} ) avto 
(3-50) 


Since all paths y(t) start from and return to the point y = 0, the 
integral over paths can be a function only of times at the end points. 
This means that the kernel can be written as 


K(b,a) = eë Salba F(t, t) (8-51) 


so K is determined except for a function of t, and & In particular, 
its dependence upon the spatial variables za and a is completely worked 
out. It should be noted that the dependence of the kernel upon the 
coefficients of the linear terms d(t) and e(t) and the remaining coeffi- 
cient f(t) is also completely worked out. 

This seems to be characteristic of various methods of doing path 
integrals; a great deal can be worked out by some general methods, 
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but often a multiplying factor is not fully determined. It must be 
determined by some other known property of the solution, as, for 
example, by Eq. (2-31). 

It is interesting to note that the approximate expression K ~ e!Sa/4 
is exact for the case that S is a quadratic form. 


Problem 3-6 Since the free-particle lagrangian is a quadratic, 
show that (Prob. 2-1) 


1m(2%%— Xa)? 


K(b,a) = F (ts, ta) exp Dit — te) 


(3-52) 
and give an argument to show that F can depend only on the difference 
F(t — ta). 


Problem 3-7 Further information about F can be obtained from 
the property expressed by Eq. (2-31). First notice that the results 
of Prob. 3-6 imply that F(t, — ta) can be written as F(t), where t is 
the time interval t — ta. By using this form for F in Eq. (3-52) and 
substituting into Eq. (2-31), express F(t + s) in terms of F(t) and 
F(s), where t = ts — te and s = te — ta. Show that if F is written as 


FO = چوا‎ 10 (3-53) 
the new function f(t) must satisfy 

f(t + s) = f(Df(s) (3-54) 
This means that f(t) must be of the form 

TO =e ee: (3-55) 


where a may be complex, that is, a = a + 78. It is difficult to obtain 
more information about the function f(t) from the principles we have 
so far laid down. However, the special choice of the normalizing con- 
stant A defined in Eq. (2-21) implies that f(e) = 1 to first order in e. 
This corresponds to setting a in Eq. (3-55) equal to 0. The resulting 
value of F(t) is in agreement with Eq. (3-3). 


It is clear from this example how the important properties of path 
integrals may be easily obtained even though the integrand may be a 
complicated function. So long as the integrand is an exponential 
function which contains the path variables only up to the second 
order, a solution that will be complete except possibly for some simple 
multiplying factors can be obtained. This is true regardless of the 
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number of variables. Thus, for example, a path integral of the form 


[foo flex ,روماه‎ -200 
Delt) Dy(t) ° ° ° 2200 (3-56) 


contains as its important factor è**, where Ea is the extremum of E 
subject to the boundary conditions. The only restriction is that in 
terms of the variables x, y, and so on, E is a function of the second 
degree. The remaining factor is a function of the times at the end 
points of the paths. For most of the path integrals which we shall 
study, the important information is contained in the exponential term 
rather than in the latter factor. In fact in most cases we shall not 
even find it necessary to evaluate this latter factor. This method of 
solving path integrals will be used frequently in the succeeding chapters. 


MOTION IN A POTENTIAL FIELD 


One simple application comes in the classical limiting case in which 
the action S is very large compared to Planck’s constant 7. As we 
have already pointed out for this situation, the kernel K is approxi- 
mately proportional to e~*S:/*, We can now see more mathematically 
the basis of this approximation. Only those paths quite near to the 
classical path Z are important, so suppose we make the substitution 
x = +y. Now if the particle is moving through the potential V (x), 
we can write 


V@) = VE +y) = V( +E HE E EE 
(3-57) 


where the prime indicates differentiation with respect to x and all 
differentials are evaluated along the classical path 2. Only small 
values of y are important, so suppose V is a sufficiently smooth func- 
tion that we can neglect terms of order y3 and higher. This means 
that we assume that y*V’” and all higher-order terms are negligible 
compared to the terms kept. 

Under this assumption the integrand can be expressed as a quad- 
ratic form in y. In fact, since 2 makes S extreme, S = Sa + terms 
second order in y. The important term in the result is eS:/*, where 
now, of course, Sa contains the potential V (z) along the classical path. 
The remaining integral on y goes from 0 to 0 and is of the form of the 
last factor in Eq. (3-50). It provides a smooth function as a factor 
to eSa, 
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The result is true in situations other than the classical limiting case. 
For example, suppose V is a quadratic function of 2. Then the solu- 
tion is exact, since the expansion of V as in Eq. (3-57) contains no 
powers higher than the second. Some examples of this type are 
given in the problems. As another example, suppose V is a slowly 
varying function. In particular, if the third and higher derivatives 
are extremely small, the result given above is a very accurate 
approximation. This particular case is called the WKB approxima- 
tion in quantum mechanics. 

There are other situations in which the approximation is good. 
Suppose the total time interval for the motion is very short. If a 
particle moves along a path differing greatly from the classical path, 
it must have a very large extra velocity (to go out from the initial 
point and then return to the final point in the allotted time interval). 
The extra kinetic energy is proportional to the square of this large 
velocity, and the action contains a term roughly proportional to the 
kinetic energy multiplied by the time interval (thus, the square of the 
velocity multiplied by the time interval). The action for such paths 
will be very large, and the phase of the amplitude will vary greatly 
for closely neighboring paths. In this case again it is reasonable to 
drop the higher-order terms in the expansion of V. 


Problem 3-8 For a harmonic oscillator the lagrangian is 


re S yes i 1 (3-58) 


Show that the resulting kernel is (see Prob. 2-2) 


Hein oF [(a” + Xp”) cos wT — 27.2] (3-59) 


K = F(T) exp 


where T = t, — ta Note that the multiplicative function F(T) has 
not been explicitly worked out. It can be obtained by other means, 
and for the harmonic oscillator it is (cf. Sec. 3-7) 


F(T) = Garton)’ (3-60) 


2rth sin wT 


Problem 3-9 Find the kernel for a particle in a constant external 
field f where the lagrangian is 


D= 54 + fz (3-61) 
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The result is 


4 aa 5 
3 (zr) SSR i ae +5 IT (te + 2) — Sa | (3-62) 


where T = t, — ta. 


Problem 3-10 The lagrangian fora particle of charge e and mass m 
in a constant external magnetic field B, in the z direction, is 


5 EOE ea r = (xy — ys) (3-63) 


Show that the resulting kernel is 


4 m \*/ wT/2 imw ( (2o — Za)? 
0 وي كم‎ Pe Gs T 


+ > cot = [(t» — La)? + (ys — Ya)?] + w(zayo — a) ( (3-64) 


where T = t — ta, w = eB/me. 


Problem 3-11 Suppose the harmonic oscillator of Prob. 3-8 is 
driven by an external force f(t). The lagrangian is 


L = — t — ae 2? + f(Dz (3-65) 


Show that the resulting kernel is 


mo 1 
K= ee etl) Sa 
217 sin wT 


where 
Sa = See | (cos wT) (as? + 2a?) — Duta 
+ 22 S” f) sin w(t — ta) di 
ae pz, fo f(t) sin w(t, — t) dt 
- 2 S? [EKOO sin a(t — ©) sin o(s — ta) ds a| (3-66) 


with T = & — ta. 
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This last result is of great importance in many advanced problems. 
It has particular applications in quantum electrodynamics because 
the electromagnetic field can be represented as a set of forced harmonic 
oscillators. 


Problem 3-12 If the wave function for a harmonic oscillator is 
(at ¢ = 0) 
20) = exp | - 3? م‎ - a| (3-67) 


then, using Eq. (3-42) and the results of Prob. 3-8, show that 


¥(2,T) = exp {- LE — عبر‎ [x? — 2axe—“T 


+ Wart(1 + To (3-68) 
and find the probability distribution |y|?. 


SYSTEMS WITH MANY VARIABLES! 


Suppose a system has several degrees of freedom. A kernel for such 
a system can be represented by the form of Eq. (2-25), where the 
symbol z(t) now represents several coordinates rather than just one. 

We take as a first example a particle moving in three dimensions. 
The path is defined by giving three functions z(t), y(t), and z(t). The 
action for a free particle, for example, is 


T fE EO? + 0? + (0° dt 


The kernel to go from some initial point (2a,ya,2a) at time ta to a final 
point (2,y2,2,) at t is 
b 4 fem, d ; 
عفدا عن ار ا اك‎ 
Da(t) Dy(t) Dz(t) (8-69) 


The differential is written as Da(t) Dy(t) Dz(t). If the time is divided 
into intervals e, the position at the time t; is given by three variables 
Ti, Yi, zi and the integral over all variables is dz;, dy; dz; for each 7 in 


IR. P. Feynman, Space-Time Approach to Non-relativistic Quantum 
Mechanics, Rev. Mod. Phys., vol. 20, no. 2, p. 371, 1948. 
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an expression like Eq. (2-22). (More generally, if we represent the 
position by a vector r in some s-dimensional space, the differential at 
each point is the volume element dv; or d'r; and we can write the prod- 
uct of these for each 2 as D*r;.) 

If the definition of Eq. (2-22) is used, then the normalizing constant 
A [Eq. (2-21)] must be included for each variable in each time interval. 
Thus if the total time interval is broken up into N steps of length e, 
the factor 473¥ must be included in the integral. 

Another situation involving several variables is that of two inter- 
acting systems. Suppose one system consists of a particle of mass m 
with coordinates given by x and the other system contains a particle 
of mass M and coordinate X. Suppose these two systems interact 
through a potential V(z,X). The resulting action is 


m 


SA AON = E t? + M Ki (eX) | 01 (3-70) 
2 |i PA 2 
so that the kernel is 


K(aXutoiteXate) = f? f; (o seo xo عه(‎ oxo (3-7 


One might understand this generalization of Eq. (2-25) mathe- 
matically. Thus one might consider the motion of a point in some 
abstract two-dimensional space of coordinates 2, X. However, it is 
much easier to think of it physically as representing the motion of two 
separate particles whose coordinates are respectively x and X. Then 
K is the amplitude that the particle of mass m goes from the point in 
space-time (Lata) to (a,f) and the particle of mass M goes from 
(Xa,ta) to (X3,t%). The kernel is then the sum of an amplitude taken 
over all possible paths of both particles between their respective end 
points. The amplitude for any particular combination of paths [i.e., 
both x(t) and X(t) are specified] is e’S/*, where S is the action defined 
in Eq. (3-70). Mathematically, the amplitude is a functional of two 
independent functions z(t) and X(t), and the integral is over both of 
the variable functions. 


SEPARABLE SYSTEMS 


Suppose we have a situation in which two particles are present, both 
moving in one or perhaps more dimensions. Let the vector x repre- 
sent the coordinates of one particle and the vector X represent the 
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coordinates of the other, as in the paragraph above, except that now 
we extend the picture to a three-dimensional space. It may happen 
that the resulting action can be separated into two parts, as 


S[x,X] = S,[x] + Sx[X] (3-72) 


where S+ involves only the paths x(t) and Sx involves only the paths 
X(t). This is the situation when the two particles do not interact. 

In this case the kernel becomes the product of one factor depending 
on x and another depending on X alone. Thus 


Ee Kix): = | f (exp 000 الادة‎ ( D'x(t) D°X(t) 


= fi (exp [f Sta} ) oxo f (exp [f Sxl} ) 0ه‎ 


= 1 lr Kot) Kix ) ts Rta) (3-73) 


Here K, is the amplitude computed as if only the particle of coordi- 
nates x were present, and Kx is defined similarly. Thus in a situation 
involving two independent noninteracting systems the kernel for an 
event involving both systems is the product of two independent kernels. 
These are the kernels for each particle to carry out its individual por- 
tion of the overall event. 

The wave function in a situation involving several particles is defined 
in a straightforward manner by analogy with the corresponding kernel 
as ¥(x,X, . . . ,t). It is interpreted as the amplitude that, at time t, 
one particle is at the point x, another particle is at the point X, etc. 
The absolute square of the wave function is the probability that one 
particle is at the point x per unit volume, another particle is at the 
point X per unit volume, etc. Equation (3-42), which holds for the 
one-dimensional case, can be immediately extended to read 


een 20 خا‎ LACK COKAN 
y(x’,X’, . . . ,t') dx’ dX’ (3574) 


where dx’ is the product of as many differentials as there are coordi- 
nates in x’ space. 

In case two independent particles are represented by the sets of 
coordinates x and X, then the kernel K is the product of one function 
of x and t and another of X and t, as mentioned above. However, this 
does not imply that, in general, y is such a product. In the special 
case that y is at some particular time a product of a function of x 
and another of X [thus y = f(x) ° g(X)], then it will remain so for all 
time. Each factor will change as it would for the partial system alone, 
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since the kernel K represents the independent motion of two particles. 
But this is a special case. Just because the particles are independent 
now does not mean that they always were. There may have been 
some interaction in the past, which would imply that y is not a simple 
product. 

Even though the action S does not appear as a simple product in 
the original coordinate system, there is often a transformation (such 
as that of center-of-gravity and internal coordinates) which will make 
it separable. Since the same form for the action is used in quantum 
mechanics as in classical physics, any transformation which will sepa- 
rate a classical system will also separate the corresponding quantum- 
mechanical system. Thus a part of the great body of work in classical 
physics can be applied directly to quantum mechanics. Such trans- 
formations are very important. It is hard to deal with a system con- 
sisting of several variables. Separation of variables permits us to 
reduce a complex problem to a number of simpler problems. 


THE PATH INTEGRAL AS A FUNCTIONAL 


When a problem contains more than one variable and a separation is 
not possible, the analysis is generally very difficult. Later on we shall 
discuss some approximations which can be applied in this case. Here 
we shall describe one very powerful tool which can sometimes be 
applied. Consider the kernel given by Eq. (3-71). This can be 
written out in detail as 


Koa = J? f fox | و‎ f Barat Û f ¥ xa 


(i a. V(a,X,0) a|} D»() DX(t) (3-75) 


First, suppose we carry out the integral over the paths X(t). The 
result can be written formally as 


K(b,a) = f; È é i! “oa at) | T{x(t)] Delt) (3-76) 


Tied) = f (exp 6 ie E x4 v@,x,0 ||) dtDX(t) (3-77) 


These results are interpreted in the following manner. Integrating 
over all paths available to the X particle produces a functional T. A 
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functional is a number whose value depends on specifying a complete 
function. For example, the area under a curve is a functional of the 
curve A = ff(y) dy. To find it, a function (the curve) must be 
specified. We write a functional as ]ل‎ f(y)] to indicate that A depends 
on the function f(y). We do not write A( f(y)), for that might be 
interpreted as a function of a function, i.e., that A just depends on 
what value f takes at some specified point y. This is not the case. 
A[f(y)] depends on the entire shape of the function f(y). It does not 
depend on y in any way. 

The functional defined in Eq. (3-77) is the amplitude that the X par- 
ticle alone goes between its end points X, and X, under the influence 
of a potential V. This potential, which depends upon both x and X ; 
is computed assuming z is held to be a fixed path as X changes. Thus 
it is the potential for the X particle when the z particle is moving along 
a specific trajectory. Clearly, this amplitude 7 depends upon the tra- 
jectory chosen for z(t), so we write it as a functional of z(t). Then 
the total amplitude is obtained by summing over all paths a func- 
tional consisting of the product of 7 and the free-particle kernel 
for x(t). 

Thus the amplitude K, like all others, is a sum over the amplitudes 
of all possible alternatives. Each of these amplitudes is a product of 
two lesser amplitudes. The first of these is the amplitude T that the 
X particle goes between its given end points when x has a specified 
trajectory. The second is the amplitude that x has that specified 
trajectory. The final sum over alternatives becomes the sum over 
all possible trajectories of x. It is important to understand this con- 
cept clearly, for it includes one of the fundamental principles of 
quantum electrodynamics, a subject which will be taken up in a later 
chapter. 

Of course it is not practical to use this method unless the integral 
T can actually be worked out, either exactly or approximately, for 
the possible values of the trajectory z(t). As we have seen (cf. Prob. 
3-11) one exact case is that in which X is a harmonic oscillator. This 
is a very important practical case. For example, when a particle 
interacts with a quantized field, the field is an oscillator. 


INTERACTION OF A PARTICLE AND 
A HARMONIC OSCILLATOR 


We shall consider in more detail the interaction of a particle and a 
harmonic oscillator. Let the coordinates of the particle be x and 
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those of the oscillator be X. The action can be written as 


S[x,X] = Solx] + ” JOZO) dt + Í, 1 M (ke + oX) dt )8-78( 


2 

where So is the action of the particle in the absence of the oscillator. 
In the discussion above we assumed that this action corresponded to 
that for a free particle. This assumption is not necessary. The 
motion of x could be complicated by the existence of a potential 
depending upon x and ¢ only. Thus, for example, the action So 
might be 


Sox] = fe 0 x? — Vea | dt (3-79) 

The second term in Eq. (3-78) represents the interaction between 
the particle and the oscillator. Note that this term is linear in X. 
Omission of a dependence upon X does not imply any loss in gener- 
ality, since if such a term were to occur, it could be removed by an 
integration by parts. We can call the coefficient g the coupling 
coefficient. Its dependence upon x(t) is indicated, but it could also 
depend upon other variables, such as x(t). Since the analysis we are 
presenting is general, it is not important to write down the exact form 
of g. The last term in Eq. (3-78) is, of course, the action of the oscil- 
lator alone. By combining this with the second term, the function T 
of Eq. (3-77) can be written as 


T\x(t)] = ie | exp G ji : (z (KX? — w2X2) 
T gix, a) | DX(t) (3-80) 


Now as far as X is concerned, the situation is just that of a forced 
harmonic oscillator. The forcing function g[x(é),t] is some special 
function of t, say, f(t). Thus the path integral is the same as that con- 
sidered in Prob. 3-11, with f(t) replaced by g[x(¢),¢] and the initial and 
final coordinate values (7,za) replaced by (X»,Xa). 

For illustrative purposes, to simplify the expressions somewhat, we 
take the special case in which the oscillator initially and finally is at 
the origin, so X, = X, = 0 (the general case is just as easily handled). 
Then according to Prob. 3-11 in this case we have 


= MW 4 0 to t 
oF 6 sin T) oH 5-5 sin wT if j glx(2),églx(s),s] 


sin w(t, — t) sin w(s — ta) ds a (3-81) 
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Therefore, the kernel for the present situation can be written 


2 mw % fb m fa. 
كات‎ 0 sin a J. (=p; k [xO at 


1 to t $ 
r VE ji g[x(t),t]g[x(s),s] sin w(ta — t) 


sin w(s — ta) ds |) Dx(t) (3-82) 


with a similar (but more complicated) expression for arbitrary روك‎ Ko 

This is a more complicated path integral than any we have had to 
solve so far. It is not possible to proceed further with the solution 
until various methods of approximation have been developed in suc- 
ceeding chapters. Note that the integrand of this path integral can 
still be thought of as being of the form es, but now S is no longer a 
function of only رغ‎ x, and 4. Instead, S contains a product of variables 
defined at two different times, s and t. The separation of past and 
future can no longer be made. This happens because the variable x 
at some previous time affects the oscillator which, at some later time, 
reacts back to affect x. No wave function (x,t) can be defined to 
give the amplitude that the particle is at some particular place x at 
a particular time ¢. Such an amplitude would be insufficient for con- 
tinuing calculations into the future, since at any time one must also 
know what the oscillator is doing. 


EVALUATION OF PATH INTEGRALS BY FOURIER SERIES 


Consider the path integral for the harmonic oscillator problem (Prob. 
3-8). This is 


K(b,a) = ‘My (exp f f, 8 E (42 — w%x?) a!) 0:00 (3-83) 


Using the methods of Sec. 3-5 this path integral can be reduced to a 
product of two functions, as in Prob. 3-8. The more important of 
these two functions depends upon the classical orbit for a harmonic. 
oscillator and is given in Eq. (3-59). The remaining function depends 
upon the time interval only and is written down in Eq. (3-60). This 
latter function can be written as 


few [5 f" $ o? - مد }| حت‎ (3-84) 


4 


F(T) = | 


0 


3-1 
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We shall solve this, at least to within a factor independent of w, by a 
method which illustrates still another way of handling path integrals. 
Since all paths y(t) go from 0 at t = 0 to 0 at t = T, such paths can 
be written as a Fourier sine series with a fundamental period of T. 
Thus 


yt) = J ansin شلك‎ (3-85) 


n 


It is possible then to consider the paths as functions of the coeff- 
cients of a, instead of functions of y at any particular value of t. This 
is a linear transformation whose jacobian J is a constant, obviously 
independent of w, m, and f. 

Of course, it is possible to evaluate this jacobian directly. However, 
here we shall avoid the evaluation of J by collecting all factors which 
are independent of w (including J) into a single constant factor. We 
can always recover the correct factor at the end, since we know the 
value for w = 0, F(T) = m/2mihT (a free particle). 

The integral for the action can be written in terms of the Fourier 
series of Eq. (3-85). Thus the kinetic-energy term becomes 


r ıa ون‎ = SV em aa cos Ê cos mat 
1 y? dt 2 T T ا‎ COS r COS A dt 
= TAL NS 
AF 5 (¥) dn )3-86( 
and similarly the potential-energy term is 
1 1 
j vate 2020 an (3-87) 


On the assumption that the time T is divided into discrete steps of 
length ء‎ as for Eq. (2-19) so that there are only a finite number N of 
coefficients an, the path integral becomes 


2 
ORO os 


Since the exponent can be separated into factors, the integral over 
each coefficient a, can be done separately. The result of one such 


72 


Developing the concepts with special examples 
73 


integration is 


= ١ 22 22 -14 
ع ]سا‎ Aee om 


Thus the path عون‎ is proportional to 
N . N 
nr? nar? w?T2\- 
I] Cor 7 a)” = 1 (ar =). I] 6 3 = (3-90) 
R=] n=1 


The first product does not depend on w and combines with the 
jacobian and other factors we have collected into a single constant. 
The second factor has the limit [(sin wT)/wT] * as N جح‎ œ, that is, 
ase— 0. Thus 


F(T) =C (Sr) " (3-91) 


where C is independent of w. But for w = 0 our integral is that for a 
free particle, for which we have already found that 


% 
ام‎ See aT) 00 
Hence for the harmonic oscillator we have 
mw J4 
ا‎ Card) (3-93) 


which is to be substituted in Eq. (3-59) to obtain the complete solution. 


Problem 3-13 By keeping track of all the constants, show that 
the implication is that the jacobian satisfies 


N 
N 
v7 (2) II 21 as N ج‎ œ (3-94) 


n=1 
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The Schrodinger 
Description of 
Quantum Mechanics 


Tur path integrals which we have discussed so far [with the exception 
of Eq. (3-82)] have integrands which are exponentials of actions with 
the property 


S[2,1] = S[2,3] + S[3,1] (4-1) 


Such path integrals can be analyzed in terms of the properties of inte- 
gral equations which can be deduced from them. We have already 
seen this in Chap. 2 [e.g., Eq. (2-31)] and Chap. 3 [e.g., Eq. (3-42)]. 

A still more convenient method is to reduce the path integrals to 
differential equations if possible. This possibility exists in quantum 
mechanics and is, in fact, the most convenient way to present that 
theory. It is in almost every case easier to solve the differential equa- 
tion than it is to evaluate the path integral directly. The conven- 
tional presentation of quantum mechanics is based on this differential 
equation, called the Schrédinger equation. Here we shall derive this 
equation from our formulation. We shall not solve this equation for 
a large number of examples, because such solutions are presented in a 
detailed and satisfactory fashion in other books on quantum mechanics. ! 

In this chapter our purpose is twofold: (1) For the reader primarily 
interested in quantum mechanics our aim is to connect the path inte- 
gral formulation with other formulations which are found in the stand- 
ard literature and textbooks so that he can continue his study in those 
books and can learn to translate back and forth between the two dif- 
ferent languages. (2) For the reader primarily interested in path 
integrals this chapter will show a technique which is available for a 
certain class of path integrals to reduce these path integrals to differ- 
ential equations. This technique is best shown by the particular 
example of quantum mechanics which we shall develop here. 


THE SCHRODINGER EQUATION 


The Differential Equation Form. The reason that we can 
develop a differential equation is that the relationship of Eq. (4-1) is 
correct for any values of the points 1, 2, and 3. For example, the 
time tł: can be only an infinitesimal time e greater than the time ts. 
This will permit us to relate the value of a path integral for one time 
to its value a short time later. In this manner we can obtain a differ- 
ential equation for the path integral. 


1 For example, see L. I. Schiff, “Quantum Mechanics,” 2d ed., McGraw-Hill 
Book Company, New York, 1955. 
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We have already found that as a consequence of Eq. (4-1) we can 
define a wave function. Furthermore, we know that the equation 


Plent) = f (اردة) لا(داردضتزواروتة) ك1‎ des (4-2) 


gives the wave function at a time t in terms of the wave function at 
a time tı. In order to obtain the differential equation that we seek, 
we apply this relationship in the special case that the time tz differs 
only by an infinitesimal interval e from tı. The kernel K(2,1) is pro- 
portional to the exponential of î/ times the action for the interval 
tı to tz. For a short interval e the action is approximately e times the 
lagrangian for this interval. That is, using the same approximation 
as that of Eq. (2-34), we have 


Wott) = fen |< aay «43 


€ 


We shall now apply this to the special case of a particle moving in 
a potential V(z,t) in one dimension, i.e., that for which L = 7222/2 — 
V(z,t). In this case Eq. (4-3) becomes 


yz, t+ e) = / | Poel 


{exp | 2 sev ٤ a 4 |) v(y,t) dy (4-4) 


The quantity (x — y)?/e appears in the exponent of the first factor. 
It is clear that if y is appreciably different from 7, this quantity is very 
large and the exponential consequently oscillates very rapidly as y 
varies. When this factor oscillates rapidly, the integral over y gives 
a very small value (because of the smooth behavior of the other factors). 
Only if y is near x (where the exponential changes more slowly) do we 
get important contributions. For this reason we make the substitu- 
tion y = x + 7 with the expectation that appreciable contributions to 
the integral will occur only for small n. We obtain 


v(x, i de é) A iz 5 eimen? heg lieli) Vata) | 3,41 (x 00 n, t) dn (4-5) 


The phase of the first exponential changes by the order of 1 radian 
when 7 is of the order ~/ch/m, so that most of the integral is contributed 
by values of 7 in this order. 

We may expand y in a power series. We need only keep terms of 
order e This implies keeping second-order terms in 7. The term 
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eV[(x + n)/2, t] may be replaced by eV (x,t) because the error is of 
higher order than e. Expanding the left-hand side to first order in > 
and the right-hand side to first order in e and second order in n, we 
obtain 


ve 


ylz t) + ع‎ L. 1 gimme E A 5 væn | 


x [ve 9 + كةو‎ +s =| dn (4-6) 


If we take the leading term on the right-hand side, we have the quan- 
tity Y(z,t) multiplied by the integral 


1 f=! decease و‎ 
reece: م‎ =1 0 


On the left-hand side we have just y¥(z,t). In order that both sides 
agree in the limit e approaches 0, it is necessary that A be so chosen 
that the expression of Eq. (4-7) equals 1. That is, 


A = (Pane) (48) 


as we have seen previously [cf. Eq. (2-21)]. This is a way of obtaining 
the quantity A in more complicated problems also. The A must be 
so chosen that the equation is correct to zero order in e. Otherwise, 
no limit will exist as e approaches 0 in the original path integral. 

In order to evaluate the right-hand side of Eq. (4-6), we shall have 
to use two integrals 


ie p Emnin dn = 0 (4-9) 
and 

afer! 1 
i ies a em ne2 dn = = (4-10) 


Writing out the right-hand side of Eq. (4-6) gives 


oy he 0 


yt =y- A الم‎ e r (4-11) 


This will be true to order e if Y satisfies the differential equation 


O 
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This is the Schrédinger equation for our problem of a particle moving 
m one dimension. Corresponding equations in more complicated situ- 


ations can be worked out in the same way, as demonstrated by the 
following problems. 


Problem 4-1 Show that for a single particle moving in three 
dimensions in a potential V the Schrédinger equation is 


2 
E OVE (4-13) 


This equation was discovered by Schrédinger in 1925 and formed the 
central feature of the development of quantum mechanics thereafter. 


The Operator Form. The equations which result from various 
problems corresponding to different forms for the lagrangian can all 
be written for convenience in the form 


Here H does not represent a number but indicates an operation on ¥. 
It is called the hamiltonian operator. For example, in Eq. (4-12) 
this operation is 


ARAE AE (4-15) 


Such an equation with operators on both sides means this: If any 
function f is written after each operator on each side, the equation 
will be true. That is, Eq. (4-15) symbolizes the statement: The 
relation 


h? of 


ii ا اك‎ a (4-16) 


holds for any function f. 


Problem 4-2 Show that for a charged particle in a magnetic field 
for which the lagrangian is 


L= + ota eb (4-17) 


where ¢ is the velocity vector, e is the charge, c is the velocity of light, 
and A and ¢ are the vector and scalar potentials, the corresponding 
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Schrédinger equation is 


hoy 1 h age : 7 wie 
-iF - <A) (Fy A) y + بوه‎ (4-18) 
Thus the hamiltonian is l 

LEIR e 7 6 


Problem 4-3 Show that the complex conjugate function y*, 
defined as the function Y with every i changed to —1, satisfies 


Pap TU (4-20) 


The notation for operators can be described by giving a number of 
examples. For example, the operator x means multiplication by ® 
the operator x? means multiplication by 2?, the operator V(x) (some 
function of z) means multiplication by V(x), the operator d/dz means 
partial differentiation with respect to x, dy/dz, ete. 

If A and B are operators, then the operator AB means that we first 
apply B and then A, that is, ABy means A(By). Thus, for example, 
the operator x(@/dx) means z times d¥/dz. On the other hand, the 
operator (0/dx)xz means the partial derivative with respect to x of xy, 
or (0/dx)(xp) = x(dp/dx) + y. 

We see that in general the operator AB and the operator BA are 
not identical. We further define the operator A + B by the rule that 
A + B operating on y is Ay + By. For example, the previous equa- 
tion can be written as an equation among operators as follows: 


8 8 
ap on eg a (4-21) 


the meaning being that (0/dx)af = x(d/ dx)f + f for any function f. 


Problem 4-4 Show 


agit oe E (4-22) 


and therefore that, for the H of Eq. (4-15), 


h? ð 
Hzr — zH = — aE (4-23) 


This operator notation is used a great deal in the conventional 
formulations of quantum mechanics. 
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The Schrodinger Equation for the Kernel. Since K (231); 
thought of as a function of the variables 2, is a special wave function 
(namely, that for a particle which starts at 1), we see that K must 
also satisfy a Schrédinger equation. Thus for the case specified by 


Eq. (4-15) 

ha ñ? 9? 

7 a, ED د‎ agp K )2:1( + V(2)K(2,1) (4-24) 
for t > tı. In general we have 

fi dK(2,1 
= a - = H:K(2,1) =0 for to > ty (4-25) 


wherein the operator H» operates on the variables 2 only. 
Problem 4-5 Using the relation 
K(2,1) = |” K(2,3)K(3,1) dzs (4-26) 


with ts — tı = e, an infinitesimal, show that if t» is greater than tı, the 
kernel K satisfies 


site 


K(2,1) — H*K(2,1) = 0 (4-27) 
0 Ot; 


where H, now operates on variables 1 only. 


The function K(2,1) defined by a path integral in Eq. (2-25) is 
defined only for ts > tı. The function is not defined if t < t It 
will prove to be very convenient in later work (e.g., Chap. 6) to define 
K(2,1) to be zero for tz > tı. [With this convention Eq. (4-2), for 
example, is valid only if t > tı.] With the condition 


K(2,1)=0 fort<t (4-28) 


it is evident that Eq. (4-25) is satisfied also for t, < tı (in a trivial 
fashion, of course, since K = 0). But this equation is not satisfied 
at the point t = tı because K(2,1) is discontinuous at t: = tı. 


Problem 4-6 Show that K(2,1) جح‎ ô(x: — xı) as t: > tı + 0. 


From the result of Prob. 4-6 we see that the derivative of K with 
respect to ts gives a delta function in the time multiplied by the height 
of the jump, 6(z2 — zı). Hence K(2,1) satisfies 


_ ECD _ HK(2,1) = — Ê a(r — Dôl - h) (4-29) 
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This equation plus the boundary condition of Eq. (4-28) could serve 
to define K(2,1) if one were to have started out from the Schrédinger 
equation as the fundamental definition in quantum mechanics. It is 
clear that the quantity K(2,1) is a kind of Green’s function for the 
Schrédinger equation. 


The Conservation of Probability. The hamiltonian operator 
given by Eq. (4-15) has the interesting property that, if f, g are any 
functions which fall off to zero at infinity, 


J. Hof ax = f7, g*Hf) da (4-30) 


The meaning of the symbols is this. On the left we are to take g, 
operate on it with H (forming Hg), and then take the complex conjugate. 
The result is then multiplied by f and integrated over all space. The 
result is the same as taking Hf, multiplying by the complex conjugate 
of g, and integrating. It is easily verified that this is true by inte- 
grating the term f(Hg)*f dx (by parts, where necessary). 

For our example in Eq. (4-15) we have for the left side of Eq. (4-30) 


Oh fo dg e ار ل‎ N 
J مه اجر‎ + ] Va"f dx = (1-o HE 
h? d? 3 
-i> 0 SS ax ep he Vg*fdx (4-31) 


(integrating by parts twice). If f, g fall off at infinity, the integrated 
parts vanish and Eq. (4-30) is established. An operator which has the 
property given by Eq. (4-30) is called hermitian. In all cases of 
quantum mechanics the hamiltonian is hermitian. For more general 
cases than that considered above the integration over our one-dimen- 
sional variable x becomes an integration (or sum) over all the variables 
of the system. 
If we put f and g equal to y(x,t), we get 


J(HV)*Y dx = fy*(Hy) dx (4-32) 
and if y satisfies the wave equation (4-14), this becomes 
9۷۳ پ‎ da ده‎ ] Ede = ؟‎ ) ] yya) =0 (4-33) 


That is, Jy*y dz is a constant independent of time.. This is easily 
interpreted. For if y is suitably normalized, y*y is the probability 
of being found at x; so the integral is the probability of being found 
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somewhere, which is certainty (or 1) and is constant. Of course, as 
far as the wave equation is concerned ¥ can be multiplied by any con- 
stant and still be a solution. Then y*y is multiplied by the square of 
the constant, and the integral is this constant squared. 

It is fundamental to our definition of y as probability amplitude 
that the integral of y*y is constant. In terms of the kernel this means 
that if f is the wave function at time tı, then at time t it has the same 
square integral. That is, if 


¥(2) = fK(2,1)f(1) da, (4-34) 
S¥*(2)¥(2) dae = ff*(1)f(D dz (4-35) 
or 


SJSK* (25a) K (2501, f*(@)) f(a) dar dzi dza 

= Jf*(zDf(z da, (4-36) 
For this to be true for arbitrary f we must have 
fK*(232},t1)K(2;21,t)) dzs = 8)24 — 21) (4-37) 


That is, in order to interpret Y as a probability amplitude, the kernel 
must satisfy Eq. (4-37). We have derived this by means of the 
Schrédinger equation. It would be nicer to demonstrate this and 
other properties, such as Eq. (4-38) and Prob. 4-7 , directly in terms 
of the path integral definition of K instead of coming through the dif- 
ferential equation. It is possible, of course, but it is not so simple or 
neat as a derivation of such a fundamental relation should be. One 
can verify Eq. (4-37) as follows: For a small interval with tı = t — 3 
Eq. (4-37) follows directly from the expression e‘“/* for this interval. 
By induction, the complete Eq. (4-37) results. One disadvantage ot 
the approach to quantum mechanics through the path integrals is the 
fact that relations involving y* or K* are not self-evident. 

By multiplying Eq. (4-37) by K(1,3) and integrating on zı, we can 
show 


fK*(2,1)K(2,3) dza = K(1,3) (4-38) 


where tz > tı > ts. Compare this to [K(1,2)K(2,3) dz: = K(1,3) for 
tı > ls > ts. We may describe the second relation this way: Starting 
at ta, K(2,3) gives us the amplitude at the later time ta. If we wish 
to go to a still later time tı, we can do so by using the kernel K(1,2). 
On the other hand, if having the amplitude at ts we want to work back 
to find it at an earlier time tı < tz, we can do this by using the func- 
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tion K*(2,1) according to Eq. (4-38). That is, we can say K*(2,1) 
undoes the work of K(1,2). 


Problem 4-7 Show that if tı > ts, the left-hand side of Eq. (4-38) 
equals K*(3,1). 


THE TIME - INDEPENDENT HAMILTONIAN 


Steady States of Definite Energy. The special case that the 
hamiltonian H is independent of time is of great practical importance. 
This corresponds to the case that the action S does not depend on the 
time explicitly; i.e., the potentials A and V, for example, do not con- 
tain t. In this case the kernel cannot depend upon the absolute time 
but instead is a function only of the interval t — tı. Asa consequence 
of this fact, there exist wave functions that depend periodically on 
the time. 

It is easiest to see what happens by studying the differential equation. 
Starting from the Schrédinger equation (4-14), we try a special solu- 
tion of the form y = f(t) ¢(x), a function of a time only multiplied by a 
function of position only. Substitution gives us the relation 


- VOH) = Hi) = 1011) (4-39) 

or 

SANE : 
ry ge (4-40) 


The left-hand side of this equation does not depend upon x, whereas 
the right-hand side is independent of t. If they are always equal, 
neither side can depend upon either variable t or 2. That is, each side 
must bea constant. Let us call this constant E. Thenf’ = —(i/h)Ef, 
or f = e ‘#/* with an arbitrary constant factor. Thus the special solu- 
tion is of the form 


y(x, t) = eGD Eth (x) (4-41) 
where ¢ satisfies 
Ho = Eo (4-42) 


That is, for this special solution the wave function oscillates with 
a definite frequency. We saw that the frequency with which a wave 
function oscillates corresponds, in classical physics, to the energy. 
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Therefore, we say that when the wave function is of this special form, 
the state has a definite energy E. For each value of E a different 
particular function ¢ [a solution of Eq. (4-42)] must be sought. 

The probability that a particle is at x is the absolute square of the 
wave function ¥, or |Y|?2. In view of Eq. (4-41) this is equal to ||? 
and does not depend upon the time. That is, the probability of find- 
ing the particle in any location is independent of the time. We say 
under these circumstances that the system is in a stationary state— 
stationary in the sense that there is no variation in the probabilities 
as a function of time. 

This situation is somewhat related to the uncertainty principle; for 
in a situation in which we know that the energy is exactly E we must 
be completely uncertain of the time. This is consonant with the idea 
that the properties of an atom in a specific state are absolutely inde- 
pendent of the time, so that at any time we would obtain the same 
result. 

Suppose that F, is a possible energy for which Eq. (4-42) has a solu- 
tion رف‎ and that E» is another value for energy for which this equation 
has some other solution ¢2. Then we know two special solutions of 
the Schrédinger equation, namely, 


Vi = Eths) and Ya = eH Etgo(z) (4-43) 


Since the Schrödinger equation is linear, it is clear that if y is a solu- 
tion, so is cy. Furthermore, if yı is a solution and ولا‎ is a solution, 
then the sum of yı and y is also a solution. Evidently, then, the 
function 


Y = ce GMB (22) + Cpe BG (2) (4-44) 


is also a solution of the Schrédinger equation. 

As a matter of fact, it can be shown that if all of the possible values 
of E and the corresponding functions ¢ are worked out, any solution 
of Y of Eq. (4-14) can be written as a linear combination of these special 
solutions of definite energy. 

The total probability to be anywhere is constant, as we showed in 
Sec. 4-1. This must be true no matter what the values of cı and C2, 
so that, using Eq. (4-44) for y, we have 


dr‏ وب dx + cfcre4™ EED f pf‏ 2يف | dx = ccf‏ للبلا[ 
)4-45( ندل وو ciecze NEED f pip dr + ccf o}‏ + 


Since this must give a constant result, the time-variable terms (i.e., 
terms including e*©/*(#:—2»)t) must vanish for all possible choices of ره‎ 
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and cz. This means 
| ۵ = |", ot dx = 0 (4-46) 


Where two functions f, g satisfy [f*g dx = 0, we say they are orthogonal. 
Thus Eq. (4-46) says that two states of different energy are orthogonal. 

Later we shall learn an interpretation for expressions such as ff*g dz, 
and we shall find that Eq. (4-46) records the fact that if a particle is 
known to have an energy E (and hence a wave function yı = e”"/*g)), 
then the amplitude that it is found to have a different energy Fz (i.e., 
wave function e*”:/*g.) must be 0. 


Problem 4-8 Show from the fact that H is hermitian that E is 
real. [Choose f = g = ¢ in Eq. (4-30).] 


Problem 4-9 Show from the fact that H is hermitian that Eq. 
(4-46) holds. [Choose f = ¢2, g = ¢, in Eq. )4-30(.[ 


Linear Combinations of Steady-state Functions. Suppose 
that our functions corresponding to the set of energy levels E, are 
not only orthogonal but also normalized, i.e., that the integral of the 
absolute square over all x is 1. Then we shall have 


fE, Ebal) dz = bam (4-47) 


where ônm, the Kronecker delta, is defined by ônm = 0 if n كع‎ m and 
dun = 1. Many functions can be expressed as a linear combination 
of such ¢,’s. In particular, any function which is likely to arise as 
a wave function can be so expressed. That is, 


oo 


f(z) = Y, angna) (4-48) 


n=1 


The coefficients a, are easily obtained; for by multiplication of Eq. 
(4-48) by the quantity ¢*(x) and integration over all x we obtain 


T “OAS (a) de = > dn | _ 00 = Om (4-49) 


n=l 


That is, 


an = | AOSE) de (4-50) 
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Thus we have the identity 


Ha) = Y (® f, OI) dy 
n=1 


= JS Oem] dy 45D 


n=1 


An interesting alternative way of expressing this result makes use of 
the definition of the delta function. We write 


az=) = ) d(x) 62(y) (4-52) 


It is possible to express the kernel K in terms of these functions ¢, 
and the energy values E,. We do so by the following consideration. 
Let us ask this: If f(x) is the known wave function at the time tı, what 
is the wave function at the time t? It can be written at any time t as 


(zy) = X cre Bate, (x) (4-53) 


n=1 


for it is a solution of the Schrödinger equation, and any solution can 
be written in this form. But at the time t, we have 


1a) = ¥(2,t1) = 2 وين‎ I Ents, (:ه)‎ = > Anon(x) (4-54) 
n=1 n=l 

since we can always express f(x) in the form of Eq. (4-48). So we 

conclude 

Ca Anet UA) Ent (4-55) 


Putting this into Eq. (4-53), we have 


¥(2,b2) = > Cre WM Enton (x) = 0 ane GD Bat) py (2) (4-56) 


n=1 n—-1 


Now using Eq. (4-50) for the coefficient an, we obtain 


V(t) = X da(aeomante | GUSU) dy 


=1 


= i 5 » bn(Z) أن‎ (y)e EEn t f(y) dy (4-57) 
n=1 
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This final expression determines the wave function at the time fe 
completely in terms of f(x), the wave function at the time tı. Previ- 
ously we represented this relation by the equation 


plet) = |, Ketau tdf) dy (4-58) 


Comparing this to Eq. (4-57), we finally obtain the desired expression 
for the kernel K(2,1), 


K (x2,t2;£1,t1) 


0 رفو‎ (x2) p} (£1) (3لة)‎ Ena) for tg > t 
n=1 
= 0 for ta < tı (4-59) 


Problem 4-10 Verify that K as defined in Eq. (4-59) satisfies the 
Schrédinger equation (4-29). 


This expression for K is very useful for translating expressions to 
more conventional representations. It expresses the kernel, which 
was originally a path integral, entirely in terms of solutions of the 
differential equation (4-42). 


Problem 4-11 Show that for free particles in three dimensions the 
solutions 


bp = eter (4-60) 


go with the energy Ep = p?/2m. Consider the vector p as an index 
n and note the orthogonality. That is, as long as p ¥ p’, 


| jey dîr =0 evenif E, = By (4-61) 
Therefore the free-particle kernel must be 


K (£2, te;f1,t1) = >. سق‎ GIR) 8 (tr) oip? (tt) | 2m (4-62) 
P 


Since the p’s are distributed over a continuum, the sum over the 
bbe e وو‎ 5 i 5 
indices” م‎ is really equivalent to an integral over the values of p, 


namely, 

2 dp 
0 قن‎ 0 C) Gray )4-63( 
Therefore, we find that the free-particle kernel is given by 


dp 


Ko(rs,tejrı,t1) = 1 P eGD م١‎ amr *صخس‎ t) mh SP 
(2rh)* 


(4-64) 
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Problem 4-12 Carry out the integral in Eq. (4-64) by completing 
the square. Show that the form of the free-particle kernel results 
li.e., the three-dimensional version of Eq. (3-3)]. 


NORMALIZING THE FREE - PARTICLE WAVE FUNCTIONS 


The derivation of the kernel for a free particle, as given in Prob. 4-11, 
is unsatisfactory for two related reasons. First, the idea of a sum 
over distinct states n used in Eq. (4-62) is not satisfactory if the states 
lie in a continuum, as they do for a free particle where any p is allowed. 
Second, the plane-wave functions for free particles, although orthog- 


onal, cannot be normalized, that is, / 0 5 dx = Í 4 ا‎ 6 50 
the condition of Eq. (4-47) used in deriving Eq. (4-62) is not satisfied. 
Both of these points can be remedied together in a perfectly straight- 
forward mathematical way. Starting all the way back when we 
expressed an arbitrary function as a sum of eigenfunctions, 


f(z) = أ(‎ angn(z) (4-65) 


we allow part, or all, of the states to lie in a continuum, so that the 
sum on n must be replaced partly by an integral. With mathematical 
care one can find the correct expression for K analogous to Eq. (4-62) 
but applying also when the states are in a continuum. 


Normalizing in a Box. Many physicists prefer another, less 
rigorous approach. What they do is modify the original problem in 
a way that (from physical reasoning) will not essentially modify the 
result yet will leave all the states separate in energy and all the simple 
sums as simple sums. In our example this may be accomplished as 
follows. We are studying the amplitude that in a finite time a par- 
ticle goes from zı at time tı to xz at time te. Now if these two points 
are some finite distance apart and the time is not extremely long, 
surely it can make no appreciable difference to the amplitude whether 
the electron is really free or is instead confined to some enormous box 
of volume V with walls very, very far from x; and .ون‎ The amplitude 
could be affected only if the particle could run out to the walls and 
back in the time وخ‎ — tı; but if the walls are far enough away, there 
is no appreciable amplitude for this. 

It is always possible that this assumption fails for some special- 
shaped walls such that, for example, xz is at a focus of waves from 1 
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reflected at the walls. From time to time someone lets an error creep 
in by replacing a system in empty space with one at the center of a 
large spherical box. The fact that the system remains at the exact 
center of a perfect sphere may have an effect (like the spot of light at 
the center of the shadow of a perfectly circular object) which does not 
vanish as the sphere radius goes to infinity. For another shape, or a 
system off-center to the sphere, the surface effect would vanish. 

Take first the case of one dimension. In empty space the space- 
dependent wave functions are e‘?7/* (any p, positive or negative). If, 
instead, the range of z is limited to —L/2 to +L/2, say, what are the 
functions ¢? The answer depends on the boundary conditions defin- 
ing datz = —L/2 and x = +L/2. The easiest conditions to under- 
stand physically are those for walls which offer very high repulsive 
potentials to the particle, thus confining it (i.e., perfect reflectors). 
They correspond to (x) = 0 at x = —L/2 and x = +L/2. The 
solutions of the wave equation 


2 a? 
- 55 S EE (4-66) 


in the range |x| > L/2 are, for E = p?/2m = h*k?/2m, 
ekz and e` ikz 


or any linear combination. Neither e** nor e~** can satisfy the bound- 
ary conditions, but with k = nr/L (n an integer) satisfactory solutions 
are given by half the sum (which is cos kx) for n odd and 147 times 
the difference (which is sin kx) for n even, as diagramed in Fig. 4-1. 
Thus the states are sines and cosines and the energy levels are separated 
(i.e., not in a continuum). 

If the solutions are written as 


2 Dae 
4p cos kz and 1 sin kz 


they are normalized, since 


L/2 2 co ae 
1 بدا‎ 60 008 kz) dz = 1 (4-67) 


A sum over states is a sum over n. If we consider, say, the sine wave 
functions (thus, even values of n) for very large L but not large x 
(walls far from the point of interest), the successive functions differ 
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Fig. 4-1 The form of the 
one-dimensional wave func- 
tions which have been nor- 
malized in a box. The first 
four are shown. The corre- 
sponding energy levels are 
E, = h?r?/2mL?, E, = 4Bı, 
E; = 9E, and E, = 1671 
The absolute magnitude of 
the energy, which depends on 
the size of our fictitious box, 
is not important for more 
realistic problems. Rather, 
it is the relation between the 
energy levels of the various 
states which has significance. 


by only a small amount. This difference 


eH ye 32 : a 2 MEE T 
HE 2r(n + Dz — sin aan | = 2 41? cos 2r —y— ع‎ sin 2r oF 


egy bel خف‎ (n = )z (4-68) 


is approximately proportional to the small quantity x/L. So a sum 
on n can be replaced by an integral over k = 2rn/L. Since the suc- 
cessive allowed values of k (for sine functions) are spaced by 27/L, 


there are x Ak states in range Ak. All of this applies also to states 


with the cosine wave function, so that we may replace sums by inte- 
grals in our formulas with 


Dt eM Maye: (4-69) 


n= 


and remember to add the result for the two kinds of wave functions, 
namely, V2/L cos kz and V2L sin kz. 
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It is often inconvenient to use sin kx and cos kz for the wave func- 
tions as we would like to use the linear combinations 


ez = cos kz + isin kx 


and e-#** = cos kz — isin kz. We were forced by our box to use sines 
and cosines and not the linear combination, because for a given k one, 
but not both, of the functions is a solution. But if we can disregard 
small errors arising from these small differences in k, we might still 
expect to be able to get the correct results from these new linear 
combinations. Normalized, they are 


1 ikr 1 —ikz 
A 76 and aE 6 


Since the wave e #** can be thought of as e** but for negative values 
of k, our new procedure, including the addition of the two kinds of 
wave functions, becomes the following practical rule: 

To deal with free-particle wave functions e***, normalize them to a 
range of x of length L (i.e., use ف‎ = ~/1/L e***), and replace sums over 
states by integrals on k with the rule that the number of states with 
k in the range k to k + dk is L dk/2r and the range of kis — œ to +. 


Periodic Boundary Conditions. Sometimes this excursion into 
cosines and sines and back to exponentials is avoided by the following 
argument. The wall is artificial anyway, so its particular position and 
the particular boundary condition should not make any physical dif- 
ference as long as it is far away. So instead of the physically simple 
conditions ¢ = 0 at z = L/2 and at x = —L/2, let us use two others 
for which the solutions are indeed e*? directly. These are 


$(2) (a r= 3) = 4(2) (at me =) (4-70) 
and 
م‎ (x) 6 C= 5) = o' (x) 6 c= =>) (4-71) 


These are called periodic boundary conditions, because the same ones 
would result by the requirement that ¢ is periodic in x in all space with 
period x = L. It is readily verified that the functions +/1/L e** are 
solutions, normalized to range L, provided k = 2rn/L with n an integer, 
positive, negative, or zero. From this our rule follows directly. 

In three dimensions we can see what happens by using a rectangular 
box of sides Lz, Ly, L, in the three directions. Let us use periodic 
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boundary conditions. That is, the magnitude and first derivative of 
a wave function at a point on one face are respectively equal to the 
magnitude and first derivative at the corresponding point on the oppo- 
site face. The wave function for a free particle normalized is 


gi tk, 4 ik, 1 ik,z — 1 iker 
NZ ° NE Vie ve (4-72) 


where V = L,L,L, is the volume of the box and the allowed values of 
kz are 2rnz/L, for n, an integer, those of k, are 2rn,/Ly, for n, an integer, 
and those of k, are 2rn./L,for n, an integer. Furthermore, the number 
of solutions with k, in range dk,, ky in dk,, and k, in dk, is 

dk dk dk ak 


EE افد‎ wet pee ١ 
27 Le 27 L, 27 L; (2r)? us ce 


: ; Fy 
That is, use plane waves normalized to volume V: 0 exp (ik ° r). 


The number of states in range d*k (differential volume of k space) is 
17 dk/ (2r). 

Let us apply this to Prob. 4-11 and recall the connection between 
momentum and wave number p = fk brought out in Sec. 3-1. In Eq. 
(4-64) we must make two changes. First, since the wave functions 
used were exp (ip *r/f), whereas we should have used 


1 م20‎ 7 
Jre 7 


there should be an additional factor 1/V; for the product of two wave 
functions was involved. Second, the symbol D ( ) must be replaced 


p 
by Vf( ) .27+2(3)/م02‎ This justifies what was done in Sec. 4-2 and 
the results given in Prob. 4-11. 
It is noted that the V factors cancel out, as indeed they must; for 
as V — œ the kernel K must be independent of the size of the box. 


Some Remarks on Mathematical Rigor. The reader may have 
one or two reactions on seeing how the volumes V cancel at the end 
of this calculation. One might be: How nicely it cancels out as it 
should, for the walls have no effect. The other might be: Why do 
it in this complicated and “dirty” nonrigorous manner, putting in 
walls which make no difference, etc., when all this can be done much 
more elegantly and rigorously mathematically without the need of 
walls, etc? It depends whether you are physically minded or mathe- 
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matically minded. There are many misunderstandings between 
mathematicians and physicists on the place of mathematical rigor in 
physics, so perhaps a word as to the value of each method (the box 
or mathematical rigor) may be in order. 

There is, of course, the more trivial point: Which is most familiar— 
which takes the least new knowledge? Most physicists have seen this 
argument about how to count the states in a box before. 

Another point is that the mathematically rigorous solution may not 
be physically rigorous. That is, the box may in fact exist. It may 
not be a rectangular box, but it is not often that experiments are 
done under the stars. Rather they are done in a room. Although 
it is physically reasonable that the walls have no effect, it is true 
that the original problem is set up as an idealization. It is no more 
satisfactory idealization to move the walls to infinity than to replace 
them by perfect mirrors far away. The mathematical rigor is wasted 
in the first idealization, since the walls are not at infinity. 

The wall approach is just as rigorous, or rather just as nonrigorous. 
It has several advantages. For example, in finding that the volume 
cancels out we do learn that at least one aspect of the idealized walls, 
namely how far away they are, is unimportant. This discovery makes 
us more intuitively convinced that the actual disposition of the real 
environment may be unimportant. Finally, the formula derived is 
very useful when in fact we do have a finite sample. For example, in 
Chap. 8 we shall use it to count sound-wave modes in a large, rec- 
tangular block of material. 

On the other hand, the advantage of the mathematically clean argu- 
ment is the avoidance of much unnecessary detail that cancels out. 
Although, using the wall approach, one may learn something about 
how the walls have no effect, one may be firmly convinced that this 
is true anyway and not wish to descend into details to see it again. 

The normalization problem is a special example, but it illustrates 
the point. The physicist cannot understand the mathematician’s care 
in solving an idealized physical problem. The physicist knows the 
real problem is much more complicated. It has already been simpli- 
fied by intuition, which discards the unimportant and often approxi- 
mates the remainder. 
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Measurements 
and Operators 


So far we have described quantum-mechanical systems as if we intended 
to measure only the coordinates of position and time. Indeed, all 
measurements of quantum-mechanical systems could be made to reduce 
eventually to position and time measurements (e.g., the position of a 
needle on a meter or the time of flight of a particle). Because of this 
possibility a theory formulated in terms of position measurements is 
complete enough in principle to describe all phenomena. Neverthe- 
less, it is convenient to try to answer directly a question involving, say, 
a measurement of momentum without insisting that the ultimate 
recording of the equipment must be a position measurement and with- 
out having to analyze in detail that part of the apparatus which con- 
verts momentum to a recorded position. Thus, in this chapter, instead 
of concentrating on the amplitude that a particle has a definite posi- 
tion, we shall develop the idea of an amplitude to find a definite 
momentum, energy, or other physical quantity. 

In the first section of the chapter we shall show how a system may 
be described in terms of momentum and energy. The concepts learned 
here will be extended in the second section to describe in general various 
ways of representing the quantum-mechanical system. The transfor- 
mation functions which enable us to go from one method of represen- 
tation to another have many interesting properties. Among them is 
the concept of an operator, which was introduced in the preceding 
chapter and will be discussed further in the third section of this chapter. 


THE MOMENTUM REPRESENTATION 


The Momentum Amplitude. So far we have used the concept 
of probability in terms of the position of a particle, but suppose we 
wish to measure the momentum. Is there an amplitude (p) whose 
absolute square will give us the probability P(p) that a measurement 
of momentum will show that the particle has the momentum p? 
There is in fact such an amplitude, and we can easily find it. 

Some ways of measuring momentum (or other physical quantities) 
correspond to measurements of position, and thus they can be analyzed 
if we know how to analyze coordinate measurements. For example, 
working in one dimension, suppose we have a particle whose position 
at t = 0 is localized within +b of the origin of the ند‎ axis. The uncer- 
tainty b can be as large as desired so long as it is finite. We can 
measure the momentum of such a particle by a time-of-flight technique. 
That is, we can observe how far the particle has traveled (assuming 
no forces) by the time t = T. If the position is x, then the velocity 
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is x/T and the momentum is p = mz/T. The error in such a momen- 
tum measurement, +mb/T, can be made as small as desired by making 
T sufficiently large. 

Suppose we analyze the momentum probability P(p) as defined by 
such an experiment. The probability P(p) dp that the momentum 
lies between p and p + dp is the probability P(x) dz that, if all the 
potentials affecting the particle are suddenly turned off, then after 
the time T the particle will be found between the points x and x + dz. 
Of course, this requires that we connect م‎ with x by p = mz/T. 
Assume the wave function of the particle as given by f(y) at t = 0, 
and our problem is to find P(p) directly in terms of S(y). 

The amplitude for the particle to arrive at x at the time t = T is 


ylz, T) = f, KoE, T 4,0 fu) dy (5-1) 


Upon substitution for the free-particle kernel Ko, this expression 
becomes 


5 m IMT? f% —imxy 
VET) = N rinn FP DAT ibe 6 AT ) 


6 aT) fly) dy (62) 


The absolute square of this amplitude gives the probability that 
the particle lies between x and x + dz. According to our definition, 
this is identical (in the limit as T— œ) with the probability that the 
momentum of the particle lies between p and p + dp. 


sour |. {ex | sep ot - 200 || ro ay | 
= P(p) dp asT— مه‎ (5-3) 


Then substituting p = mz/T, and supposing that we pass to the limit 
of large T, there results 


J = | exp Ga 5 الا‎ | Sty) dy | (5-4) 


We assumed earlier that, initially, the particle would be restricted 
to a region within +b of the origin. This means that the initial wave 
function f(y) drops to 0 for values of y larger in absolute magnitude 
than b. Now as T becomes large the quantity imb?/2hT becomes 
negligibly small. Since there is no contribution to the integral of Eq. 


0 
P(p) dp = 55; 
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Fig. 5-1 The amplitude for particles traveling freely to arrive at the position z in the ti..1e 
interval T is determined by the product of two functions. ‘The first is the amplitude f(y) 
for the particle to start at some position y, as shown by the shaded curve in the figure. 
The second, the amplitude to get from y to z, is the free-particle kernel K(zx,7';y,0), as 
shown by the sine wave of slowly changing wavelength in the figure. (Here we treat the 
final position z as the origin of this function, since y is our variable.) If the point z is far 
from the origin, compared to the distance —b to +b over which f(y) is nonzero, the wave 
has an approximately constant wavelength. Its approximate form is exp [(—7/h) (mz/T)y]. 
The two functions are multiplied together and then integrated over y to get the final 
amplitude for arrival. Since all particles have traveled approximately the same distance z 
(again assuming z > b) in the same time T, this final amplitude is equivalent to the 
amplitude that the particles have the momentum p = mz/T. 


(5-4) for values of y greater in absolute magnitude than b, the proba- 
bility P(p) dp approaches dp/2rh times the absolute square of the 
amplitude} 


oo) = f° (exp EE) su) dy (5-5) 


An alternative explanation of this result is given in Fig. 5-1 and 
extended in Fig. 5-2. 

The expression for the momentum amplitude given by Eq. (5-5) 
applies to a one-dimensional situation. It is easy to extend the defi- 


1 Many writers prefer to account for the factor 1/277 in the definition of $(p), 
where it appears as 1/./2rh. However, following the development of Sec. 4-3, 
we prefer to write it in the form we have used and remember that the differential 
element of momentum always includes the factor 1/27 in each dimension. For 
example, the differential element of momentum in three-dimensional momentum 
space is d3p/(2rh)3. 
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Fig. 5-2 If the amplitude f(y) is 
roughly periodic with the same wave- 
length as the overlying kernel, as 
shown in (a), then the integral of the 
product of the two functions is very 
large. That is, the probability that 
the momentum is p = mz/T is large. 

On the other hand, suppose the 
wavelengths differ for some new func- 
tion f’(y) as shown in (b). Then, 
when the product is taken, the con- 
tributions to the integral from dif- 
ferent values of y will cancel each 
other. Now the probability that the 
momentum is mz/T is small. 

If a new position v’ is chosen as a 
final point, as shown in (c), then a new 
region of the kernel curve overlies the 
space —b to +b. For a correct 
choice of z’, the wavelength of the 
kernel in this new region is the same as 
the wavelength of f’(y) and a large 
probability results. That is, there is 
a large probability that such particles 
have the new momentum value 
p = ma! /T. 


| l 
-4!4 Position, y y=x’ 


(c) 
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nition to the three-dimensional case where the amplitude for the 
momentum is 


60) = f {exp | = f o-2 || هار‎ arr (5-6) 


where the wave function f(r) is now assumed to be defined for all 
points in the three-dimensional coordinate space. This is the ampli- 
tude that the particle has the momentum p at the time t = 0. (Note 
that it is not defined for the time t = T. The time interval T is part 
of the measuring equipment, and it can be varied without changing 
the momentum amplitude.) The square of this amplitude, multiplied 
by the differential momentum element, gives the probability of finding 
the momentum in the interval (three-dimensional) d*p/(2rh)* of 
momentum space. 

We have analyzed a momentum measurement which is based on a 
time-of-flight technique. However, such an analysis can be applied 
to other techniques. The analysis of any technique for measuring 
momentum will give the same result for the momentum amplitude. 
For suppose we have two methods or techniques which purport to 
measure the same quantity, momentum. If one gives a different result 
than the other, we have to explain why one or the other apparatus is 
faulty. So if you will grant that the time-of-flight technique is an 
adequate way to define a momentum measurement, any other piece 
of equipment which measures momentum must give the same results 
P(p) dp for the distribution of momenta if the system is in the state 
f(y). Analysis of any equipment which measures momentum must 
give the same expression (p) for the amplitude for momentum p, 
within possibly an irrelevant constant phase difference (i.e., a factor 
e? with ô constant). For example, consider the following problem. 


Problem 5-1 Consider any piece of experimental equipment 
designed to measure momentum by means of a classical approximation, 
such as a magnetic field analyzer. Analyze the equipment by the 
methods outlined in the preceding paragraphs. Show that the same 
result for the momentum amplitude is obtained. 


Transformation to Momentum Representation. We have 
called ¥(R,t) the amplitude for a particle to be at the point R at the 
time ¢. We have found that the momentum amplitude is given by 


60,0 = f ® [exp | = f 0-0 || هت قيهن‎ (62) 
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We shall call this the amplitude that the particle has momentum p at 
the time t. 

It is often useful to analyze problems in this momentum representa- 
tion rather than in the coordinate representation, or, as it is often 
stated, in momentum space rather than in coordinate space. Actu- 
ally, the transformation from one representation to the other is just 
a Fourier ‘transform. Thus if we have the momentum representation 
and wish to find the coordinate representation, we use the inverse 
transform given by 


oP, (5-8)‏ ممه | ١م‏ رمك | f”‏ = مكار 


We can describe this last formula in the same physical terms we 
have used to describe the structure of other amplitudes. The ampli- 
tude that the particle is at the position R is given by the sum over 
alternatives. In this case the alternatives correspond to the products 
of two terms. One of these is the amplitude that the momentum of 
the particle is p, given by (p). The other term, exp (ip- R/h), is 
the amplitude that if the momentum is p, then the particle is at the posi- 
tion R. This second factor is not new to us, for we have discussed 
such an expression in Prob. 3-4. 

Note that in the transform of Eq. (5-7) the exponent has a minus 
sign. Such a term can be described in a manner parallel to that used 
in the preceding paragraph. Thus we can say that exp (—ip ٠ R/h) 
is the amplitude that if a particle is at position R, it has the momentum p. 


The Kernel in Momentum Representation. We have shown 
(Sec. 3-4) how a wave function at a particular time t can be derived 
from the wave function at an earlier time tı with the help of the kernel 
describing the motion of the particle in the intervening time. Thus 


(Rat) =f." [7 K(Rsta;RytDY(Ruyt1) dR, di (5-9) 


It is possible to define a kernel in momentum space which would be 
used in a parallel expression. Thus the momentum amplitude at the 
time t can be derived from the momentum amplitude at an earlier 
time tı by 


2 1 0 
(pat) = fi |" K(pataiDit) (ارده)ة‎ Gr Ah (5-10) 


Substituting in Eq. (5-9) for ¥(Ri,t:) the expression of Eq. (5-8) 
and taking the Fourier transform of ¥(Rz,t2) to gel $(po,te), as in Eq. 
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f axis 


Fig. 5-3 The kernel for a free 
particle in momentum space is 
unlike the kernel in coordinate 
space. In momentum space, 
there is only one path which can 
carry the particle to the momen- 
tum value وم‎ at the time təz. 
That single path must start at 
the momentum pi = .وم‎ No 
other paths contribute to the 
kernel. 
P axis 


(5-7), we see that the kernel in momentum representation is given in 
terms of the kernel in coordinate representation by the expression 


Ri fR: 1 i 
K (P2,t2;P1,t1) = | / eUD RK (Ro,l2jR 1,t1)e FDPIR! 
dR, dR (5-11) 


For example, the kernel describing the motion of a free particle in 
momentum space is 


Ri R: 5 
Ko(P2,to;pa,ts) = ff جلمسبمهرفس‎ (Rete Riyhre™™ QR, dR, 
١ 2 
= (2rh)*53(pi a P2) exp E ilp:| (te -== n) | for to > ty 


2hm 
= 0 for t2 > ty (5-12) 


[The last line follows the convention of Eq. (4-28).] The occurrence 
of the delta function in this expression shows that the momentum of 
a free particle does not change, as diagramed in Fig. 5-3. However, 
the phase of the momentum wave function changes continuously in 
accordance with the factor ei*‘/*, where E = p?/2m. This result 
given by Eq. (5-12) can also be seen directly from Eq. (4-64). 

This latter kernel offers a much simpler representation of the free 
particle than does the kernel in coordinate space. Generally, when 
the particle is not free, but rather moves under the influence of a 
potential, the kernel in momentum representation loses its simplicity. 
But if the effect of the potential can be represented in a perturbation 

, expansion, this simplicity is regained (Chap. 6). 


The Energy-Time Transformation. For many applications, 
particularly in relativistic quantum mechanics, it is best to treat the 
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variables of space and time in a symmetric manner. Then in trans- 
forming from coordinate representation to momentum representation 
we include a transformation from time to energy. ‘Thus the complete 
transformation for a kernel is 


Ri /R: fo 0 
k(p2,E2;p1, E1) = Í 1 (ee Í. eG وج‎ Rao IM Fits (Ro la ;R 1,1) 
١ ell حو نر‎ lilh) By ty dR, 031 011 dtz (5-13) 


The energy EB is not equal to p?/2m, but is instead an extra independent 
variable (the coefficient of time) needed to define the kernel. Only if 
the system exists in the same energy state for an infinite time can an 
exact measurement of E be made to establish the relation between 
energy and momentum. 

As an example, we shall work out the kernel for a free particle. For 
this case the integrals over R; and R: have already been worked out, 
with the results given in Eq. (5-12). Thus we are left with the inte- 
grals over tı and t. Make the substitution tz = tı +7. Then the 
double integral can be written as 


i} pe: eGA (E —E)t dt, ih elI) (Zy—p"/2m)r qy (5-14) 


The first of these two integrals is a representation of the Dirac ô 
function. In particular it is 27%6(Z, = E1). The second integral is 
of the form 


f eier dr (5-15) 


This latter integral arises often in quantum-mechanical problems. If 
w is a real number, the integral does not converge. In order to carry 
out the present calculation, we shall replace w with a complex number 
w + îe. When both w and e are real numbers, the integral has the 
value 1/(w + ie). 

Now it would be possible to take the limit of this fraction as e 
approaches 0 and interpret the result simply as i/w. However, such 
an interpretation would lead to incorrect (or rather, incomplete) results 
in further work. The function we are evaluating is a kernel, and in 
future work it will often be integrated (multiplied by some other 
function) over values of w or its equivalent. If e were dropped from 
the expression, then such integrals would have a pole at w = 0, and 
we would be at a loss what to do. 

It would not be correct to take just the principal part of the integral 
at such a pole. This would give the wrong result. In particular, such 
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a result would imply that the inverse transform of the kernel would 
not give back the original coordinate representation kernel with which 
we started. Such a transform would differ from the correct kernel 
in that it would not be zero for values of time less than zero. One 
way to obtain the correct result from such integrals is to place the 
pole an infinitesimal distance above the real axis. This is accom- 
plished by leaving e in the expression. 
If we rationalize the expression as 
(5-16) 


€ 
otie ote EE wane 
we can interpret the first term on the right-hand side as t/w and in 
further integrations use the principal part of an integral involving 
this term. The second term becomes rê(w) as e approaches 0, and it 
is to be interpreted as such in further integrations. That is, if a more 
precise mathematical definition is wanted, ¢/(w + ie) should be replaced 
by P.P.(i/w) + 76(w). This means that 


o0 A F 
f et dr = lim : 
0 e0 W + 6 


BE (0 o (5-17) 


In all expressions containing e, a limit at e— 0 is implied. 
Returning to the evaluation of the kernel, we replace w with 
E, — p?/2m to find 


2 
ko(p2,H23p1, E1) = (2rh)*5° (pe 2 pı) ô(E? = Eji 6 — 5- + ie) 
(5-18) 


The existence of the delta functions in this expression means that 
neither the energy HE nor the momentum م‎ changes during the motion 
of a free particle. These two quantities affect the motion of the par- 
ticle as shown by the last term of this equation. That is, the ampli- 
tude for the motion from one point to another of a free particle with 
energy E and momentum p is proportional to (E — p?/2m + ie). 

Earlier in this section it was mentioned that the energy E is not 
in general identical to p?/2m, but is instead a separate variable. To 
understand the distinction, let us look at the kernel for a free particle, 
which is a wave-like function in time and space and wherein E is the 
coefficient of time and thus has the properties of a frequency. This 
kernel, given in Eq. (5-12), has the form shown in Fig. 5-4 when plotted 
against the time difference T = tz — tı. 
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Fig. 5-4 The real part of the free- 
particle kernel Ko plotted vs. time. 
The function is zero for negative 

7 times, then starts with a sharp 
jump at t = 0 and continues as a 
cosine wave of constant amplitude 
and frequency. 


K is zero for T less than zero, and it suddenly begins to oscillate 
at T = 0. The transformation from time to energy representation is 
equivalent to a Fourier transformation. Since the wave has a sharp 
beginning (at T = 0), the Fourier transform contains components at 
all frequencies and thus at all energies. If the function extends over 
a long time interval (many periods), then one frequency begins to 
dominate in the Fourier transform. For the free particle this domi- 
nating frequency corresponds to the energy Eo = p?/2m. 

It is for this reason that the free-particle kernel contains the factor 


SEAT IETS ea acts 
Ey? — py?/2m + te د‎ 86 5 7 trò 6 z) (5-19) 


Here the first term on the right accounts for the transient effects that 
result from the sudden start at t = 0. The second term gives the 
steady-state behavior and shows that, if we wait long enough, the only 
energy found is the usual p?/2m; but near t = 0 the energy is not given 
by this classical formula. 


Problem 5-2 If we transform only the time and not the spatial 
variables, defining 


k(z2,B: 301,H1) = J fe MEK (x2,t2 Tpi) OLE dtz dt; (5-20) 
show that for a system with a time-independent hamiltonian H 


* 
k(t, Bait, E) = 2rhið(E: - Es) Y poner (5-21) 


where m and Em are the eigenfunctions and eigenvalues of H. 
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5-2 MEASUREMENT OF QUANTUM - MECHANICAL VARIABLES 


The Characteristic Function. In the preceding section we have 
shown how an experiment designed to measure momentum leads to a 
definition of a probability distribution for the momentum. That is, 
from the results of a correctly designed experiment we can answer the 
question: What is the probability that the momentum of a particle is p? 
From the existence of a probability function for momentum we were 
led to the discovery of a wave function or amplitude written in terms 
of momentum variables. In fact, we found that a system could be 
completely described and problems completely analyzed in a momen- 
tum-energy representation as well as the space-time representation 
which we have used heretofore. 

These same results apply to physical variables other than momentum. 
If any physical quantity can be measured experimentally, a probability 
function can be associated with it. That is, if an experiment is capable 
of measuring some characteristic A associated with a system (e.g., the 
x component of momentum), then after repeating the experiment sev- 
eral times it will be possible to construct the probability function P(a) 
which gives the probability that in any particular experiment the 
numerical values of A will be found to be equal to a. 

In general, it is possible to associate a probability amplitude with 
such a probability function. This amplitude would be defined in 
terms of the measured variable, together with other variables neces- 
sary to complete the specification. Let us see what is involved by 
generalizing our example of a momentum measurement. First we 
shall take just one dimension, but the extension to several dimensions 
will be obvious. We ask: Does the system have the property G? For 
example, G might stand for the statement: The value of the quan- 
tity A is equal toa. We must have some way to answer this experi- 
mentally. So let us imagine some equipment can be designed so that, 
if it has the property G, the particle will pass through the equipment 
and arrive at a certain location on some screen or meter. 

The probability of this may be written 


PG@) = |JKezp(f,2)f(x) dal? (5-22) 


if f(z) is the wave function of the system to be measured, K.2p(¢,x) is 
the kernel for going through the particular experimental apparatus, 
and ¢ is the position of arrival for particles with the property G. This 
probability has the alternative mathematical form 


P@) = |Jg*(x)f(@) da|? (5-23) 
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where we have put 
g* (x) = Kesp(f,2) (5-24) 


(Defining this as the complex conjugate of a function is just for con- 
venience, as we shall see later.) So we can say 


Y(G) = Sg*(x)f(x) dx (5-25) 


is the amplitude that the system has the property G. This concept 
is further described in Fig. 5-5. 

The property is defined by the function g*(x) for the following 
reason. Suppose some other experiment with different equipment, and 
hence a different kernel K.2»(n,x), should be built to measure the same 
property. In this second experiment the particle arrives at n. Then 
the probability of finding that the system has the property G is 


[Keep(n,t)f(x) “نمك‎ or | fg"*(x) f(a) dex? (5-26) 


Since the property measured is the same, we must obtain the same 
result in every case for P(@) as we did with the previous experiment. 
That is to say, we must have 


lfg'*(x)f(a) dz|* = |f9*(x)f(a) dzļ? (5-27) 


for any arbitrary function f(x). This means g*(z) = g’*(x) within 
at least an unimportant constant phase factor e®, That is, all methods 


mm‏ د 


Fig. 5-5 A device designed to 
measure the property © is placed 
between the initial point of the 
incoming particle [with wave func- 
tion f(x)] and the final point x = ¢. 
The equipment modifies the kernel 
for the motion (compare Figs. 5-1 
and 5-2), making it equal to g(z). 
The product f(x)g(x), integrated 
over x, is the amplitude to arrive 
at ¢ after passing through the 
equipment. 
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to determine the same property correspond (within a phase) to the 
same g*(x). For this reason we call g*(x) the characteristic function 
of the property G. 

We may ask another question. What must the state f(x) be so 
that it is sure to have the property G? (For example, what is the 
wave function for a particle whose momentum is definite?) That is, 
we wish to find an f(x), say, F(x), so that the particle going through 
the apparatus will certainly arrive at ¢ and at no other point ¢’. The 
amplitude to arrive at ¢’ should be proportional to 40 — ¢) (that is, 
zero unless ¢’ = ¢). Hence 


JKeap(t,z)F (x) dx = (5 — ¢) (5-28) 


This we can solve by the relation of the complex conjugate of a kernel 
to its inverse, discussed in Sec. 4-1. We have from Eq. (4-37) 


KEDE (S,2) dx = 5(¢ — ¢’) (5-29) 
so that 
F(x) = = KAGET x) = g(x) (5-30) 


That is, g(x) is the wave function of a particle having the property G 
with certainty. We can say either (1) the particle has the property 
G or (2) the particle is in the state g(x). So we find: If a particle is 
in a state f(x), the amplitude it will be found in a state g(x) is 


¥G@) = Jg*(@)f(x) dz = [()واة‎ (5-31) 


For more dimensions, x becomes a space of several variables. 

We might say loosely: The probability that the particle is in the 
state g(x) is |fg*(x)f(x) dz|?. This is all right if we know what we 
mean. The system is in state f(x), so it is not in g(x); but if a meas- 
urement is made to ask if it is also in g(x), the answer will be affirma- 
tive with the probability 


P@ = | | Oa) de} = l=) (5-32) 


A measurement which asks: Is the state g(x)? will always have the 
answer yes if the wave function actually is g(x). For all other wave 
functions, repetition of the experiment will result in yes some fraction 
P (between 0 and 1) of the tries. This is a central result for the 
probabilistic interpretation of the theory of quantum mechanics. 

For all of this we deduce an interesting inverse relationship between 
a wave function and its complex conjugate. In accordance with the 
interpretation of Eq. (5-25), g*(x) is the amplitude that if a system 
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is in position z, then it has the property G. [Such a statement is put 
mathematically by substituting a ô function for f(x) in Eq. (5-31),] 
On the other hand, g(x) is the amplitude that if the system has the 
property G, it is in position z. (This is just a way of giving the defi- 
nition of a wave function.) One function gives the amplitude for: 
If A, then B. The other function gives the amplitude for: If B, then A. 
The inversion is accomplished simply by taking the complex conjugate. 

Equation (5-31) can be interpreted as follows: The amplitude that 
a system has property © is (1) the amplitude f(x) that it is at x times 
(2) the amplitude g*(z) that if it is at z, it has property G, with this 
product summed over the alternatives z. 


Problem 5-3 Assume if ee f*(x)f(x) dx, which is the probability 


that a particle of wave function f(z) is somewhere, has been normalized 
to the value 1. Under this constraint, show that the state f(x) which 
has the highest probability of having the property © is f(x) = g(x). 


Problem 5-4 Suppose the wave function for a system is (a1) at 
time tı. Suppose further that the behavior of the system is described 
by the kernel K(x2,to;x1,t:) for motions in the interval t, > t > ty. 
Show that the probability that the system is found to be in the state 
x(x) at time tz is given by the square of the integral 


le 0 x (ونه)‎ K (z£2,t2;£1,t1)Y (£1) 02:1 dae 


We call this integral the transition amplitude to go from state y(x) to 
state x(x). 


Measurements of Several Variables. In the considerations of 
the preceding section we assumed an ideal experiment, which means 
that no quantity besides A could be measured at the same time. That 
is, we do not allow that more than one g(x) would give the same result, 
but assert that the maximum possible amount of information has been 
obtained from the system by a measurement of A. 

Now in reality it is common for several variables to determine 
the state of a system. For example, if only the x component of 
momentum is measured in a three-dimensional system, no definite 
g(x) can be defined. Both the wave functions exp (ip:r/h) and 
exp (tp.x/h — ip,y/h) give the same value pz for the x component of 
momentum. So if only pz is measured in a three-dimensional system, 
the particle could be moving with any component of momentum in the 
y direction and not change the outcome of the measurement. Nor 
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need the particle come to some unique point in the measuring apparatus. 
All the particles which arrived at some line or set of points could have 
the same value for pz. 

Thus in general, we see that the wave function g(x) defines the prop- 
erty G as follows: A state described by the wave function g(z) is certain 
to have the property G. However, the converse is not necessarily true. 
That is, it is not certain that all states having the property © are 
described by the wave function g(x). Only if G includes a specifica- 
tion of all the quantities that may be simultaneously measured is the 
wave function completely defined by @. Even then there remains an 
undefined (and unimportant) constant phase factor eû. 

It is easy to make the necessary extension of the characteristic func- 
tion g*(x) when the ideal experiment requires the measurement of more 
than one variable. Thus suppose we have a set of quantities which 
we shall call A, B, C, . . . , and which can all be simultaneously meas- 
ured in an experiment, for example, the z component of momentum, 
the y component of momentum, etc. Suppose we can completely 
describe the state of a system by specifying the numerical values 
a, b, c, . . . assigned to these quantities. That is, we completely 
describe the state by saying whether or not it has a certain property. 
In this case the property in question is that the value of A is a, the 
value of B is b, ete. Furthermore, suppose that no additional infor- 
mation (information not derivable from a knowledge of the numerical 
values of A, B, etc.) could be obtained simultaneously by any means. 

Imagine we have an experimental setup capable of measuring all 
these quantities, i.e., capable of telling us whether or not the state 
has the property that the value of A is a, etc. We shall call the 
characteristic function of such a property 


G2) = )...ير‎ (5-33) 


This function is, of course, a function of the numerical values a, b, 
c, . . . which the experiment is set up to measure, as well as the 
coordinate variable 7. 

Suppose the system is in the state f(x). Then the probability that 
the experiment would show that the value of A is a, the value of B 
is b, etc. (i.e., the probability that the state has the property in ques- 
tion), is 


P(a,b,c, . . .( = |Jxéin,c,...(x)f(x) da? (5-34) 


Transformation Functions. Suppose the system is actually in 
the state ..,,مم مج‎ (£), that is, the value of A is a’ , etc. Then with 
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our experiment the probability of finding the system in a state described 
by a, b, c,... is zero unless a =a’, b = DG kom E This 
means inet with suitable normalizing fator we have: 


‘pe XE (Dx... (®) dx = (a — a’)5(b — b’)d(c — c’) (5-35) 


The function Xa»,e,...(«) is the amplitude that if the system is in the 
state described by a,b,c, . . . , then it will be found at x. The func- 
tion x24,¢,...(2), whieh we have called the characteristic function, is 
the amplitude that, if the system is at x, it will be found in the state 
specified by a, b, c, : 

If the system = cowl to be in the state f(z), then 


Pose... = [, O (ef) dx (5-36) 


is the amplitude to find the system in the state specified by A having 
the value a, B having the value b, ete. 

The quantities F.,»,«,... are just as good a representation of the state 
as the function f(z,y,z,...). In fact, if we know the function 
Fa,,e,..., We can reproduce the function f(x,y,z, . . .) by means of an 
inverse transformation. 

The function F'..,c,...is called the A, B,C, . . . representation of the 
state. (In the preceding section we had an example of this in the 
momentum representation.) The function f(x,y,z, . . .) is the custom- 
ary coordinate representation, Or 7, Y, 2, . . . representation, of thestate. 
Transformations between the two are carried out with the help of the 
functions x and x*. In particular, the function x*,,.. (x,y,z, . . .( 
is the transformation function necessary to go from the v, y, 2, . . . 
representation to the A, B, C, . . . representation, while the func- 
tion Xa,b,c,...(£,Y,z, . . .) is the transformation function necessary to 
go the other direction. Thus the inverse of the transformation given 
by Eq. (5-36) is 


f(z,y,2, sare a) = DW g ee Xa EAA, ا‎ 3) (5-37) 


This says that the amplitude to be found at g is the amplitude F,,,... 
to be found with A = a, B = b, . . . times the amplitude xo,5,c,...(z) 
to beat z if A = a, B = b, etc., summed over alternatives a, b, c, 


Problem 5-5 Assume that the function f(x,y,z, . . .) can be repre- 
sented by 


E. > pyre Ge ICA eee) (5-38) 


Quantum mechanics and path integrals 


By substituting this relation into Eq. (5-36), and using the orthogonal 
properties of x as defined by Eq. (5-35), show that ...عرق ل = ...و1‎ 


Problem 5-6 Suppose A, B, and C are the three cartesian com- 
ponents of momentum Pz, Py, pz. What is the form of the function 
Xa,b,c(v,y,2)? Using the results of Sec. 5-2, verify the relations obtained 
in Sec. 5-1. 


Problem 5-7 Suppose that the A, B,C, . . . representation does 
not correspond to either coordinate representation or momentum 
representation, but instead is some third way of representing the state 
of the system. Suppose we know the function xq,,c,...(,Y,2, - . .( 
which permits us to transform back and forth between coordinate 
representation and A, B, C, . . . representation. Suppose further 
that we know the transformation function necessary to transform back 
and forth between coordinate representation and momentum repre- 
sentation. What then is the function necessary for the transformation 
between momentum representation and A, B,C, . . . representation? 


OPERATORS 


Expected Values. We can develop a few further properties of 
these transformation functions. Let us try to answer this question. 
A system is in a state specified by the wave function f(x), and the 
quantity A is measured. If the measurement is repeated many times, 
what is the average value which will be obtained for A? We shall 
denote this average value (sometimes called expected value) by the 


symbol (A). 
Suppose it is possible, in principle, to measure simultaneously 
several physical quantities A, B, C, . . . , where a measurement of A 


could produce any one of a continuous or discrete set of values a, a 
measurement of B could produce any one of a continuous or discrete 
set of values b, etc. The probability of obtaining one particular set 


of values a,b,c, . . . is |Fas.c,.../2. So the probability of obtaining a 
particular value a in a measurement of A, irrespective of the values 
taken on by B, C, . . . (for example, if B,C, . . . were not measured 
at all) is 

P=), noe EE (5-39) 


In this equation summations are carried out over all possible values 
in the continuous or discrete sets of b, c, . 
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The average, or expected, value resulting from a measurement of A 
is obtained by multiplying the probability of Eq. (5-89) by a and 
summing the result over all possible values of a. Thus 


(A) = OD FR (5-40) 


The need for computing such expected values arises frequently in 
quantum-mechanical problems. It is useful to have available formulas 
which simplify such computations. This subject, the subject of oper- 
ators, was discussed briefly in Sec. 4-1. Now we shall develop a few 
additional results. However, nowhere in this book shall we attempt a 
really thorough study of operator calculus, since several excellent 
works along this line are already available.! 


The Operator. Let us try to express the expected value of A 
directly in terms of the original wave function f(x). Note first that 
the absolute square of Fa»,e,... can be written as 


As È = RE Sr erie (5-41) 


Then, using Eq. (5-36), we can write 


ee) Peal Reon oe) dada Ciera 
aD te 
= | °, ORC) dz (5-42) 
In the second line of this equation we have made use of the substitution 
Ra) = | °, Gale,2" fle’) de’ (5-43) 
where we have written 
Galen") = YY ° ٠ ° axate.. xan... @) (5-44) 


Equation (5-43) says that the function R(x) results from the func- 
tion f(x) as the result of an integration performed with the help of a 
linear integral operator Ga(a,x’) associated with the quantity A. 


1 For example, see Dirac, ‘Principles of Quantum Mechanics,” Clarendon Press, 
Oxford, 1947. 
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Often an equation like Eq. (5-43) is symbolized by the notation 
ا‎ > (5-45) 


where @ stands for a linear operator which operates on the function f. 
In the present case @ stands for the operation displayed on the right- 
hand side of Eq. (5-43), that is, multiplication by the function G4 and 
integration. The operator @ is associated with the physical quantity 
A. Using this notation, we can write 


(A) = ۳ J (a) @f(x) dx = T Ja f*(x)Ga(z,x")f(a') dx’ dx (5-46) 


Problem 5-8 Note that Eq. (5-44) implies G3(z,2’) = Ga(z’,z). 
With this in mind show that for any two wave functions g(x) and f(z), 
both of which approach 0 as x goes to œ, 


|, Oaa) dx = COOL (5-47) 


Any operator, such as @, for which Eq. (5-47) holds is called hermitian 
[cf. Eq. (4-30)]. 


Problem 5-9 The transformation function between space repre- 
sentation and momentum representation is 


Xeo,b,c(r) = e/a pr (5-48) 


(see Prob. 5-6). Choose the physical quantity A as the momentum 
pz in the ند‎ direction. Show that the function Ga is 


Gp, (a) =" (æ — a')aly — èle — 2” (5-49) 


d ; : ; 
where ô'(x) = EE: d(x). With this result determine the operator cor- 


responding to the x component of momentum and show that the 
expected value of this component of momentum can be written as 


f°, PO a (5-50)‏ = وما 


Problem 5-10 Suppose the quantity A corresponds to the x coordi- 
nate of position. Show that the correct formulas for the mean value 
of x result when the function G4(z,x’) is taken to be 


G,(2,2') = 28(z — 2x')d'(y — y’)6'(z — 2’) (5-51) 
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and the operator corresponding to x is simply multiplication by 7, 
that is, 
Xf(x) = af(z) (5-52) 


Eigenfunctions and Eigenvalues. The wave functions Xa,b,e,..., 
as discussed in Sec. 5-2, show a particularly simple behavior when sub- 
jected to the operation @. Thus 


QXa,b,c,... (x) = OX nbere (x) (5-53) 
Problem 5-11 Show that this last result is true. 


When a function x satisfies an equation such as (5-53), we say that 
x is an eigenfunction of the operator @ associated with the eigenvalue a. 

If two physical quantities can be simultaneously measured, then 
the operators associated with these quantities, @ and ®, for example, 
satisfy an interesting relationship, namely, @(®f) = @(@f). This 
relation says that the result of performing one operation after the 
other is the same regardless of the order in which the operations are 
performed. In this case the two operators are said to commute: 


@2°'8=8°@ 


In general, we cannot expect the commutation relation to hold 
between operators, but in this special case it does. The reason for this 
is that if A and B are physical quantities which can be measured simul- 
taneously, they can form part of a set A, B,C, . . . of simultaneously 
measurable quantities with a single characteristic function Xa be. 
If the operator © is substituted for @ and the value b is substituted 
for a in Eq. (5-53), the result is still valid, so 


@(@x) = @(bx) = b(@x) = bax = abx (5-54) 
which is true, since a, b are just numbers. Now also 
@(@x) = B(ax) = a(®x) = abx (5-55) 


A comparison of these two equations proves the commutation of the 
operators @ and ® when acting upon any of the functions xa,5,c,...- 
Since both these operations are linear (i.e., they do not involve compu- 
tations with higher powers of the function x), the commutation rela- 
tion must apply to any linear combination of the x functions. 

If the x functions comprise a “complete set” (which is typical) we 
can construct any function at all from such a linear combination. So 
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the operation AB and the operation BA give the same result on any 
function; that is, they commute. 


Problem 5-12 Show that the xz coordinate of position and the ند‎ 
coordinate of momentum are not simultaneously measurable quantities. 


There are situations in which a set of commuting mathematical 
operations ©, ©, €, . . . are already known and it is required to find 
the functions (the eigenfunctions) which are associated with them. 
This requires solving a set of equations such as 


QX = ax @x = bx Cx = cx eels (5-56) 
For example, suppose the operators for momentum in the 7, y, 2 
TA ; 289 0 O 
directions Pz, py, and p, are given as (h/2) 5 2/2) N (h/i) 3z 


What are the eigenfunctions of this set of operators corresponding to 
a state in which pz has the value ره‎ p, has the value b, and p, has the 
value c (these are, of course, the eigenvalues)? We must solve the 


equations 
hox _ hoax _ 0x 
E ا‎ E DIX (5-57) 


and the solution is some arbitrary constant times e“/*)@z+bytee), This 
agrees with our previous knowledge that a particle having a known 
momentum p has the wave function eA, 


Interpretation of Energy Expansion. Various expressions 
involving @¢,(x) can be interpreted more completely now. For 
example, consider the expansion in Eq. (4-59) of the kernel in terms 
of the solutions ¢, of a constant hamiltonian 


K (22,t2;21,t1) = Y bn(we) bn (aie WD Bala) (5-58) 


We notice first that (x) is the amplitude that if we are in state n, 
we are at position x. Therefore, from our previous discussion (cf. 
Sec. 5-2) $*(x) is the amplitude that if we are at x, we are in state n. 
Now let us interpret Eq. (5-58) this way. The amplitude to get 
from position 1 at time t; to position 2 at time te is the sum over alterna- 
tives. This time the alternatives will be divided into the various 
energy states in which the transition can be made. Thus we must sum 
over all of the states n the product of the following terms: 
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1. $*(x1), which is the amplitude that if we are at xı, then we are 
in the state n. 

2. eGA Enlt), which is the amplitude to be in state n at the time 
te if we are in the state n at the time tı. 

3. ġn(£2), which is the amplitude to be found at x when we know 
we are in the state n. 


Problem 5-13 Discuss the possibility of interpreting ¢,(x) as a 
Xa.b,c,... (x) function discussed in Sec. 5-2. That is, say ¢,(x) is the 
transformation function to go from the x representation to a represen- 
tation specified by n (energy representation). 


+ There is no amplitude to change the state. That is the importance of these 
particular functions Qn. 


6 


The Perturbation 
Method in 
Quantum Mechanics 


Ir a quantum-mechanical system is subjected to a potential which 
introduces only quadratic terms into the action, then we have seen in 
Chap. 3 how the resulting motion can be determined with the path 
integral method. However, many of the interesting potentials which 
arise in quantum-mechanical problems are not of this special type and 
cannot be handled so easily. In this chapter we shall develop a method 
of treating more complicated potentials. The method which we dis- 
cuss, called the perturbation expansion, is most useful when the poten- 
tial is comparatively weak (compared, for instance, to the kinetic 
energy of the system). ١ 

Although the perturbation expansion can be developed along 
strictly mathematical lines, it is capable of an interesting physical 
interpretation. This interpretation, which we shall also present, leads 
to a deeper understanding of quantum-mechanical behavior. 

In the second section of this chapter we shall undertake a special 
application of the perturbation method. We shall consider the motion 
of an electron when it is scattered by an atom. In describing the 
scattering interaction we shall find useful the classical notion of a cross- 
sectional area which the atom presents to the impinging electron. 
Although this area is related to the actual size of the atom, we shall 
find that its complete description depends upon the quantum-mechan- 
ical aspects of the interacting system. 


THE PERTURBATION EXPANSION 


The Terms of the Expansion. Suppose a particle is moving in a 
potential V (x,t). For the present, the motion will be restricted to one 
dimension. Then the kernel for motion between the points a and b is 


Ky(b,a) = f? 6 li he E i — veo | |) De(t) (6-1) 


The subscript notation Ky is used to remind us that the particle is in 
the potential V. From now on the notation Ko will denote the kernel 
for the motion of a free particle. 

In some cases the kernel Ky can be determined by the methods 
already studied. For instance, in Chap. 3 we determined the kernel 
for the harmonic oscillator subject to an outside force f(t). Here the 
potential was [see Eq. (3-65)] 


V(z,t) = 5 wnt — xf(t) (6-2) 
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In general, we have found that if the potential is quadratic in z, the 
kernel can be determined exactly, whereas if it is sufficiently slowly 
varying, the semiclassical approximation is adequate. There are some 
other types of potentials which can be successfully treated with the 
help of Schrédinger’s equation. Now we are studying a technique 
which is often useful if the effect of the potential is small. 

Suppose the potential is small, or more precisely, suppose the time 
integral of the potential along a path is small compared to 7. Then 
the part of the exponential of Eq. (6-1) which depends upon V (x,t) can 
be expanded as 


exp E ; , Ved 4| ie ; ibe V (x,t) dt 


+2) | f van al ++ (6-3) 


which is defined along any particular path z(t). 
Using this expansion in Eq. (6-1) results in 


Ky(b,a) = Ko(b,a) F K® (b,a) T K (b,a) PETA (6-4) 
where 
K,(b,a) = He | exp 6 A n a) | Da (t) (6-5) 


K® (b,a) = — f y ] مه‎ (; He 0 at) | 8 V[a(s),s] ds Da(t) (6-6) 


K®(b,a) = — a li [exp (i jä sit at) | jee V[x(s)] ds 
x f Vis(s'),s'] ds' Da) (6-7) 


and so forth. To avoid confusion in the integrals over V, we call the 
time variables s, s’, ete. 


Evaluation of the Terms. First consider the kernel K®. We 
wish to interchange the order of integration over the variable x and 
the path z(t). We write 


K(b,a) = - $ |? F(s) ds (6-8) 


where 


F(s) = ‘a [exp (j 7 i me a) | Viz(s),s] D(0 (6-9) 


a 
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Fig. 6-1 A particle starts from a and moves as 
a free particle to c. Here it is acted upon, or 
scattered, by the potential V[z(s),s] = Ve. 
Thereafter it moves as a free particle to b. 
The amplitude for such a motion is given in 
Eq. (6-10). If this amplitude is integrated 
over all possible positions of the point c, the 
result is the first-order term in the perturbation 
expansion. ١ 


x 


The path integral F(s) can be described as follows. It is the sum 
over all paths of the free-particle amplitude. However, each path 
is weighted by the potential V[z(s),s] evaluated at the time s. The 
only characteristic of the path x(t) which is involved in this particular 
V is the position of the path at the particular time t = s. This means 
that before and after the time s the paths involved in F(s) are the paths 
of an ordinary free particle. The situation is sketched in Fig. 6-1. 

Using the same arguments which led to Eq. (2-31), we divide each 
path into two parts, one before the time ¢ = s and one after this time. 
To be specific, we shall assume that each path goes through the point 
ze at this division time. Later on we shall integrate over all values of 
ze. If we denote the point xe(s) by c (that is, s = te), then the sum 
over all such paths can be written as Ko(b,c)Ko(c,a). This means that 
F(s) = F(t.) ean be written as 


F(t) = f”, Kolbe) V (waste) Ko(c,a) dze (6-10) 
Substituting this into Eq. (6-8) gives [with V(c) = V(a.,t.)] 


K®(b,a) = = K f = K(b,c) V(c)Ko(c,a) dze dt (6-11) 


Here the limits on the integral over x have been written as + œ. 
In a practical problem the limits will be established by the potential 
(which in most cases drops to 0 when x becomes very large) or by the 
equipment, which restricts the range of z. 


Interpretation of the Terms. Equation (6-11) is very important 
and very useful, so we shall develop a special interpretation to help 
think about it physically. We call the interaction between the poten- 
tial and the particle a scattering; thus we say that the potential scatters 
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the particle and that the amplitude to be scattered by a potential is 
—(i/h)V per unit volume and per unit time. 

With this interpretation we can describe Ky in the following way. 
Ky is, of course, a sum over alternative ways in which the particle may 
move from point a to point b. The alternatives are: 


1. The particle may not be scattered at all [Ko(b,a)]. 

2. The particle may be scattered once [K“(b,a)]. 

3. The particle may be scattered twice [K®(b,a)]. 
Etc. 


In accordance with this interpretation, the various paths of the parti- 
cles are diagramed in Fig. 6-2. 

Each one of these alternatives-is itself a sum over alternatives. 
Consider, for example, the kernel for a single scattering, K® (b,a). 


a (3) 


Fig. 6-2 In (1) a particle moves from a to b through the potential V without being scat- 
tered. The amplitude for this is Ko(b,a). In (2) the particle is scattered once at c as it 
moves through the potential V. The amplitude for this is K® (b,a). In (3) the particle is 
scattered twice with the amplitude K (b,c), and in (4) it is scattered n times, the last 
scattering taking place at c. The total amplitude for motion from a to b with any number 
of scatterings is Ko + KY + KO سل‎ --- PKM FE... 
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One of the alternatives which comprise this kernel consists of the 
following motion. The particle starts from a, moves as a free particle 
to the point Ze, te = c, is there scattered by the potential V(c), after 
which it moves as a free particle from the point c to the final position b. 
The amplitude for such a path is 


Ko(b,0) Vie) at, 4» | Ke (6-12) 


(It must be remembered that in the convention we are using the motion 
of the particle is traced by reading the formulas backward, i.e., from 
right to left.) 

The construction of this amplitude follows the rule stated in Sec. 2-5, 
namely, that the amplitudes for events occurring in succession in time 
multiply. The completed form for the kernel K® is obtained by 
adding up all such alternatives by integrating over x. and te, as indi- 
cated in Eq. (6-11). 

Using this reasoning, we can write down the kernel K for double 
scattering immediately as 


K (b,a) = (- i) 1 K (b,c) V (c) Ko(c,d) V (d) K o(d,a) dre dra (6-13) 


where dr = dxdt. Reading from right to left, this formula means: 
The particle moves as a free particle from a to d. At d the particle gets 
scattered by the potential V(d) at that point. It then moves as a 
free particle from d to c, where it is scattered by the potential V(c). 
After that it moves from c to b, again as a free particle. Wesum over 
all the alternatives, namely, all places and times that the scattering 
may take place. 

Here we have tacitly assumed that te > tz. In order to avoid the 
complication of having to introduce this assumption explicitly in each 
such example, we shall make use of the convention adopted in Chap. 4 
[cf. Eq. (4-28)] and assume 


K(b,a) = 0 torio GC (6-14) 


Then Eq. (6-13) is correct without any restrictions on the range of inte- 
gration of te and ta. 

The reader may wonder what happened to the factor 14 which 
appears in Eq. (6-7) but is apparently omitted in Eq. (6-13). Note 
that in Eq. (6-13) the range of integration for tg is still from ta to ta; 
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however, the range of t, has been restricted [by the definition of Eq. 
(6-14)] to lie between tg and t. This restriction cuts the value of the 
double integral in half. To see this more clearly, suppose the double 
integral of Eq. (6-7) is rewritten as 


Ihe 8 V[»(s),s]V[z(s’),s'] ae dee ibe ipa V[x(s),s]V[x(s"),8'] de!’ ds 
RE T is V[x(s),s]V[x(s’),s’] ds’ ds (6-15) 


The first term on the right-hand side of this equation satisfies the 
restrictions implied by Eq. (6-14). By interchanging the order of 
integration, the second term on the right-hand side can be rewritten as 


JE [ Vie, Ve") s'1 ds ds! (6-16) 


If the variables s and s’ are interchanged in the last expression, the 
value of the double integral remains the same. This means that the 
first and second terms on the right-hand side of Eq. (6-15) are equal, 
so each one is half the value of the original double integral. This same 
sort of argument will account for a factor 1/7! in an expression for K™). 


Problem 6-1 Suppose the potential can be written as U + V, 
where V is small but U is large. Suppose further that the kernel for 
motion in the potential of U alone can be worked out (for example, U 
might be quadratic in z and independent of time). Show that the 
motion in complete potential U + V is described by Eqs. (6-4), (6-11), 
(6-13), and (6-14) with Ko replaced by Kv, where Ky is the kernel 
for motion in the potential U alone. Thus we can consider V as a 
perturbation on the potential U. We can say that —(i/h)V is the 
amplitude to be scattered by the perturbed part of the potential 
(per unit volume and per unit time). Ky is the amplitude for the 
motion in the system in the unperturbed potential U. 


Problem 6-2 Suppose a system consists of two particles which 
interact only through a potential V(x,y), where x represents the 
coordinates of the first particle and y represents the coordinates of 
the second [cf. Sec. 3-8 and Eq. (3-75)]. Apart from this interaction, 
the particles are free. If V were 0, then K would be simply a product 
of the two free-particle kernels. Using this fact, develop a perturba- 
tion expansion for Ky(2s,ys,t0jta,Yata). By what rules of physical 
reasoning can the various terms in this expansion be described? 
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6-2 AN INTEGRAL EQUATION FOR Ky 


Before applying the results of the preceding paragraphs to a special 
example, we shall develop some mathematical relations involving the 
kernels and wave functions of systems moving in a potential field. 
Using the results so far obtained, we can write Eq. (6-4) as follows: 


Ky(b,a) = Ko(b,a) — * f Ko(b,e)V(e)Ko(c,a) dr. 
4 (- x) [| Kv, V@©Ko(c,a)V(@)Ko(d,a) dre dra + > -+ (6-17) 
Alternatively, this expression could be written as 
Ky(b,a) = Ko(b,a) — $ | Ko(b,e) V0) Ex 
-$ [ Ko(c,a)V(@)Ko(d,a) dra + ٠ 1 dre (6-18) 


The expression in the brackets has the same form as Eq. (6-17). In 
both cases the sums extend over an infinite number of terms. This 
means that Ky can be written as 


Kea) =k Baye م‎ f Kolbe) V()Kv(c,a) dre (6-19) 


which is an exact expression. This is an integral equation determin- 
ing Ky if Ko is known. (Note that for the situation described in 
Prob. 6-1, Ko would be replaced by Kv.) Thus the path integral 
problem has been transformed into an integral equation. 

This last result can be understood physically in the following way. 
The total amplitude for the transition of the system from a to b, 
with any number of scatterings, can be expressed as the sum of two 
alternatives. The first alternative is the amplitude that the tran- 
sition takes place with no scatterings, which is expressed by Ko. The 
second alternative is the amplitude that the transition takes place 
with one or more scatterings, which is given by the last term of Eq. 
(6-19). In this last term the point c can be thought of as the point 
at which the last scattering takes place. Thus the system moves from 
a to c in the potential field with its motion exactly described by 
Ky(c,a). Then at the point c the final scattering takes place, after 
which the system moves as a free system (without scattering) to the 
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(2) 


Fig. 6-3 In (1) the particle moves from a to b through the potential V as a free particle, 
described by the amplitude Ko(b,a). In (2) the particle is scattered one or more times by 
V, with the last scattering taking place at c. The motion from a to c is described by 
Ky(c,a), and that from c to b by Ko(b,c). A combination of the two situations, when all 
positions of c are accounted for, covers all possible cases and gives Ky(b,a) in the form of 
Eq. (6-19). 


point b, as represented by the kernel Ko. This interpretation is dia- 
gramed in Fig. 6-3. 

Since the last scattering could take place at any point in space 
and time between a and b, the amplitude for this composite motion, 
represented by the integrand of the last term of Eq. (6-19), must be 
integrated over all possible positions of the point c. 


Problem 6-3 For a free particle, Eq. (4-29) reduces to 


h? o? 
2m ðt? 


= 00 00 hay ل‎ tt btn ار‎ E 

b 

Show, from this result and Eq. (6-19), that the kernel Ky satisfies the 
differential equation 

2 2 

Be ie (h ) ee 


1 Oty 2m 02:2 Ky(b,a) Jr V(b) Ky(b,a) 


= hôte — La)ôlte — ta) (6-21) 


6-3 AN EXPANSION FOR THE WAVE FUNCTION 


In Sec. 3-4 we introduced the idea of a wave function and discussed 
some relations between wave functions and kernels. Equation (3-42) 
of that section shows how the wave function at the time 4 can be 
obtained from the wave function at an earlier time ta with the help of 
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the kernel describing the motion of the system between the two times. 
For our present purposes this equation can be written as 


¥(b) = JKv(b,a)f(a) dza (6-22) 


where f(a) is the value of the wave function at the time t = ta [that is, 
f(a) is a function of za], ¥(b) is the wave function at the later time 
t = &,f and we suppose that between the two times the system is 
moving in the potential V, with the motion described by the kernel 
Ky(b,a). 

If the series expansion [Eq. (6-18)] for Ky is substituted into this 
equation, the result will be a series expansion for ¥(b). Thus 


VO) = | Ko(b,a)f(a) dra — ; | Kolb,c) V(e)Ko(c,a) dre f(a) dza 


+ +++ (6-23) 


The first term of the series gives the wave function at the time t 
assuming the system to be free (or unperturbed, in case Ky is to be 
substituted for Ao) between the times ta and t. Call this term ¢. 
Thus 


$(b) = [Ko(b,a)f(a) dza (6-24) 


Using this definition, the series of Eq. (6-23) can be rewritten as 
V) = $b) - z | Kolb,c)V(c) (0) dre 
F 7 i i K (b,c) V(c)Ko(c,d) V(d) 9)0( dre dra + ٠ ٠ ٠ (6-25) 


In this form the series is called the Born expansion for y. If only 
the first two terms are included (thus only through first order in V), 
the result is the first Born approximation. It involves a single scatter- 
ing by the potential V. This scattering occurs at the point c. Up 
to this point the system described by ¢(c) is free; and after the scatter- 
ing, the system moves from c to b, again free, and described by K o(b,c). 
An integral must be taken over all the possible points at which the 
scattering occurs. If three terms of the series are used (thus through 


terms of second order in V), the result is called the second Born approxi- 
mation, etc. 


t Note that our convention that K(b,a) is zero for t < ta makes Eq. (6-22) 
invalid if t, < ta, but we shall not use it in this range of ¢ values. 
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Problem 6-4 Using arguments similar to those leading to Eq. 
(6-19), show that the wave function y(b) satisfies the integral equation 


vb) = 60) — È f Kub,0VOVO dre (6-26) 


This integral equation is equivalent to the Schrédinger equation 


hoy , fh’ _, es 

mae ا‎ U (6-27) 
Working in one dimension only, show how the Schrödinger equation 
may be deduced from the integral equation. 


THE SCATTERING OF AN ELECTRON BY AN ATOM 


Mathematical Treatment. We have developed the concepts and 
formulas of the perturbation treatment in a somewhat abstract frame- 
work. Now, to develop a physical understanding of the perturbation 
method, we shall discuss the specific problem of the scattering of a fast 
electron by an atom. We envision an experiment in which a beam 
of electrons bombards a target, such as a thin foil of metal, and then is 
collected by some suitable counter, as shown in Fig. 6-4. 


Fig. 6-4 Electrons boil off a hot filament at a, are screened into a beam by collimating 
holes in s and s’, and then strike a thin-foil target at O. Most of the electrons pass straight 
on without being scattered (if their energy is great enough and the target is thin enough), 
but some are deflected by interactions with atoms in the target and scattered, for example, 
through an angle 0 to b. As the counter at b is moved up and down, the relation between 
the relative number of scatterings and the scattering angle 0 can be measured. 
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Suppose the energy of the scattering particles is determined by a 
time-of-flight method. That is, we release an electron from the source 
at one time, say, t = 0, and ask for the chance that it arrives at the 
counter after some delay T. We can then make direct use of our result 
for the amplitude K(b,a) to go from one place to another in a definite 
time. ١ 

We shall simplify the problem by assuming that either the foil is 
so thin or the interaction is so weak that each electron can interact 
with, at most, one atom. Actually, this assumption is quite realistic 
for many scattering experiments. Furthermore, most multiple scat- 
terings can be analyzed in terms of the simple scattering from one atom. 
Thus we shall discuss the interaction between a single electron and a 
single atom. 

The center of the atom will be taken as the center of a coordinate 
system in which the electrons are released at the point a, as in Fig. 6-5, 
at the time ¢ = 0. A counter placed at the point b tells us whether 
or not, at the time t = T, the electron arrives at the point b. We 
shall make the following approximations: 


1. The interaction can be represented by a first-order Born approx- 
imation. That is, the electron is scattered only once by the atom. 

2. The atom can be represented by a potential V(r) fixed in space 
and constant in time. 


Actually, the atom presents a very complicated system interacting 
with the electron, and the interaction between the electron and the 
atom is really more complicated than can be represented by a simple 
potential V(r). The electron could excite or ionize the atom and lose 
energy in the process. It can be shown, however, that if we con- 
sider only elastic collisions between the electron and the atom, so 
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Fig. 6-5 The geometry of the scattering problem. The electron starts at a and moves as 
a free particle to c, where it is scattered by the atomic potential V(r). After the scattering, 
it moves to the counter at b, which is located at the end of the radius vector R, from the 
scattering center O. In this process, the electron has been scattered through the angle 9, 
measured from the direction of the nonscattered beam. ‘This process corresponds to the 
first-order Born approximation. If the amplitude for two scatterings, say, at c and d, is 
included, then the result is the second-order approximation, etc. 
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that the atom is in the same energy state after the collision as it is 
before, then when the approximation (1) is valid, approximation (2) 
is valid too. 

Let Ra and R, be the vectors from the center of the atom to the 
points at which the electron is released and detected, respectively. 
In the calculations we shall take R, and R, to have bata: much larger 
than the radius of the atom. That is, we shall assume that the atomic 
potential V(r) becomes negligibly اليك‎ at distances much smaller 
than |R,| and |R,|. Thus during most of its flight, the electron will be 
moving as a free particle, and only in the vicinity of the origin will 
it be exposed to the potential. 

The first-order Born approximation contains two terms, only the 
second of which is of interest to us here. The first term is the kernel 
K o(b,a) for the motion of the electron from a to b as a free particle, and 
it has already been studied sufficiently. The term of interest is then 


Cn aye ^ f Kolbe) V()Ko(ea) dre 


a drec 0 [exp E 1 | 
x ieee [ex S d'r dt (6-28) 


Here we have used r as the vector from the origin to the point c, and 
dîr represents the product of the differentials of all the components 
of the vector r. The integral over ¢ gives (cf. Appendix) 


if m \% fal 1 
K® (b,a) ڪڪ‎ ii (zar) Tf G + A 


x {exp Ee ree Dal V(r) dîr (6-29) 


: 


where ra = |Ra — r| and r, = |R, — r|. Using these definitions, we 
write 
. 214 
r= ).م‎ - t+) Ra E1. TF (6-30) 
RT ماح‎ : 
re = Ro (.- apa + i) = Ro — rT (6-31) 


where i, and i, are unit vectors in the direction of —R, and Rs, respec- 
tively (that is, ia = —R./R., where Ra = |Ra|), and we have made 
use in the approximation of the fact that R, is much larger than any 
value of |r| for which the potential is not negligible. It is necessary to 


Quantum mechanics and path integrals 


keep the first-order terms in r only in the exponential factor, since 
this factor is quite sensitive to small relative changes in phase. Here 
we need 


(Ta + 1»)? ~ (Ra 3 Ry)? S 2(Ra + Ro) (ia GE OF ip 1 r) (6-32) 


Using these approximations, the kernel can be written as 
) 3 1 1 im 
4 الب كذ ل م‎ Biss 2 
SERA Neda (sar) 86 m) {exp Ee itech the) | 
x Ja {exp E (Ra + Ro) (iaer — ise د‎ V(r) dêr (6-33) 


Physical Interpretation. We can deduce some of the physical 
characteristics of the motion from a study of Eq. (6-33). In the 
time T the electron has traveled the total distance of Ra + Ry. Thus 
its velocity during this time is u = (Ra + R;)/T and its energy is 
mu?/2, while its momentum has the magnitude mu. In writing these 
expressions we are making the assumption that the energy of the 
electron is not changed by the scattering process. 

That these values for the velocities, energy, and momentum are 
consistent can be verified from an inspection of the exponential factor 
appearing in front of the integral of Eq. (6-33). The phase of this 
exponential term is im(R, + R,)?/2KhT, and the derivative of this 
phase with respect to T gives the frequency as 


Ra + R)? 
a الت‎ (6-34) 


With u defined as above this means that the energy is mu?/2 [cf. Eq. 
(3-15)]. 

Differentiating the phase with respect to F» yields the wave number 
at the point b as 


m Ra + R 
ا‎ EE gE )6-35( 


which means that the magnitude of the momentum is mu [cf. Eq. 
(3-12). 


Problem 6-5 The integral over ¢ of Eq. (6-28) could be approxi- 
mated by the method of stationary phase. By studying the applica- 
tion of such a method to this integral, show that most of the contribu- 
tion to the integral comes from values of ¢ near the region t = R,/u, the 
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time at which the electron would arrive at the center of the atom if it 
moved in a classical manner. 


With the velocity of the electron defined as u = (Ra + Rs)/T 
define the incoming vector of momentum p, as 


Pa = muia (6-36) 
and the outgoing vector momentum p» as 

Pe = mui (6-37) 
Then Eq. (6-33) can be written as 


k 5 é 
K®(b,a) = — 5 (a) TERR: [exp 0 1) | 
X fe exp E (Pa — Po) ° || V(r) dîr (6-38) 


Call the change in momentum, or the momentum transfer, 


q = Po — Po 
and define the quantity v(q) as 
v(q) = ji " elMarV (r) dîr (6-39) 


The probability that an electron arrives at the point b is given 
by the square of the absolute value of the kernel Ky(b,a). Thus the 
probability will depend upon the first term in the series expansion of 
this kernel, namely, Ko(b,a), which is likely to be so large as to com- 
pletely overshadow the small perturbation term K“(b,a). 

For this reason it is customary in most scattering experiments to 
collimate the incoming beam with suitable shields so that those 
electrons which are not scattered by the atoms in the target are con- 
fined to the region of a particular line (or direction), as shown in Fig. 
6-6. Of course, there will be some diffraction by the collimating 
shields, such as that studied in Secs. 3-2 and 3-3, which means that 
some nonscattered electrons will appear outside this central beam. 
However, with suitable collimation, and for positions suitably far away 
from the collimated beam, the number of electrons diffracted by the 
collimator will be very small compared to the number scattered by 
the atoms in the target. 

In such a region the probability of arrival for an electron is given, 
at least to first order, by the square of the absolute value of K® (b,a) 
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alone. Using Eqs. (6-38) and (6-39), this probability is 


P(b) ae ly Nee ee Hai 
unit volume 1 G5) TRR; lv(q)| (6-40) 


In this last expression the factor v(q) contains the characteristics 
of the atomic potential and the dependence of the kernel upon the 
relative directions of R, and Rə. It is completely independent of 
the dimensions of the experimental equipment. The effects of such 
dimensions are represented by the remaining factors of Eq. (6-40). 
For example, the term 1/R,? can be easily seen to result from the idea 
that the chance for an electron to actually hit the atom varies inversely 
as Ra. The application of such an idea might be questioned in this 
experiment in view of the fact that we have supposed some collimating 
shields are present. However, this collimation has a negligible effect 
over atomic dimensions. From the point of view of a target atom 
the beam of oncoming electrons appears to consist of electrons spread- 
ing in all directions from a single source. 

In a similar manner, after the scattering, the electrons spread out 
again in all directions from the scattering atom. Thus the chance per 
unit volume to find an electron in the counter varies inversely as Ro. 
Since the more interesting features of the experiment are contained in 
the function v(q), we shall give special attention to this function in the 
next section. 

The additional factors depend on the particular normalization of 
our kernel. We can interpret the formula more easily if we give it as 
a ratio. We compare the probability of finding a scattered particle 
at b to the probability of finding one at a point d behind the atom at 
the same total distance Ra + Fe (and at the same time T, to keep the 
velocity the same) if no scattering occurred, as shown in Fig. 6-7. 
That is, we calculate P(d) per unit volume, as if no atom were present. 
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Fig. 6-6 Principle of collimation to eliminate the zero-order term at b: Only electrons 
which have been scattered at least once can get from a to b with any reasonable probability. 
Thus the zero-order term in the perturbation expansion of Ky(b,a) will contribute a negli- 
gible amount and can be neglected. The first term of importance is K“(,a). 
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Fig. 6-7 If d and b are the same total distance from O, namely, Re, then the difference 
(or ratio) of numbers of electrons arriving at the two points can depend only on the scat- 
tering phenomenon. If d is in the direct line of nonscattered electrons, the ratio of the 
number arriving at b to that arriving at d if no scattering source were present is the proba- 
bility of scattering to b. 


The result is |Ko(d,a)|? or 


P(d) M ENE u? 
unit volume  \2rÃ/ T(Re + Ro)? (6-41) 
so that 
P(b) m \2 R, + R:)? 
P - مهم‎ Enh Ra? ie (6-42) 


We shall interpret the last factor geometrically in the next section, 
where we shall also give more detailed attention to the function v(q). 


The Cross Section for Scattering. It is convenient to describe 
the characteristics of an atom in a scattering experiment by means of 
the concept of a cross section. The utility of such a concept stems 
from the convenience of thinking along the lines of classical physics. 
The cross section e is defined as the effective target area (from a 
classical point of view) of the atom that must be hit by an electron 
in order that the electron be scattered into a unit solid angle. This 
solid angle is measured around a sphere whose center is at the atom. 
The cross section is thus a function of the scattering angle, i.e., the 
angle between R, and Rẹ. In terms of such a classical model we can 
determine the probability that an electron arrives at the point b. 

If particles starting from the origin were to hit a small target of 
area do at distance Ra, these particles would be removed from the 
region d, where they are spread out over an area [((Ra + Be) / Ra]? do. 
Instead they are sent out in a solid angle 92 toward b and are therefore 
spread out over an area R? d9 there, as shown in Fig. 6-8. Hence 
the ratio of the probability of finding them at b to that of finding them 
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Fig. 6-8 Particle striking an area do of the target are deflected through an angle @ into an 
area measured by the solid angle dQ. If no target had been present, the particles would 
have proceeded to the point d. Instead, they proceed to the point b, spreading out into 
the area R,? dQ. The probability of finding a particle at d is inversely proportional to the 
area over which the beam would have spread in arriving atd. Similarly, the probability of 
finding the particle at b is inversely proportional to the area R»? dQ over which the beam of 
scattered particles spreads in traveling from the target to b. If we take the ratio of these 
areas, we have the inverse ratio of the associated probabilities. From this point of view 
we say that all of the particles which hit the target area do are scattered, and through a 
particular angle 6. Of course, actually only a few particles which hit the target are scat- 
tered at all and only a fraction of these through the angle 6. Thus, the area element de 
which we have used in this calculation is the effective cross-sectional area for scattering 
through the angle 6 measured in terms of the element of solid angle dQ into which the 
particles are scattered. 


at d if there were no target is the inverse ratio of these areas, 


P(b) _ (Ra + Ro)? do/R? 
Pay Ro? dQ SAB) 


On comparing Eqs. (6-42) and (6-48), we see that the cross section per 
unit solid angle is 


a= (sn) TO} (6-44) 


The main advantage of an expression in terms of cross section 
instead of using Eq. (6-40) directly is this: Equation (6-44) does not 
depend on particular experimental conditions, so the cross sections 
obtained in one or another experiment can be directly compared, 
whereas probabilities per unit volume cannot be. 

It must be emphasized that this idea of an effective target is purely 
classical and is convenient in recording scattering probabilities. There 
is no direct relation between it and the size of the atom, nor is the 
scattering mechanism to be thought of as localized exactly over such 
anarea, For example, the shadow one would expect to find classically 
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behind the target is not there in the classical sense (with sharp bound- 
aries); for since we are dealing with a wave phenomenon, there is 
diffraction into the shadow. 


Special Forms of the Atomic Potential. The results obtained 
when the atomic potential V(r) is assumed to have various forms are 
shown in the following problems. 


Problem 6-6 Suppose the potential is that of a central force. 
Thus V(r) = ae Show that v(q) can be written as 


na جو‎ —— a r (sing g) V(r) dr (6-45) 


Suppose V(r) is the Coulomb potential Ze?/r. In this case the integral 
for v(q) is oscillatory at the upper limit. But convergence of the 
integral can be artificially forced by introducing the factor e7“ and 
then taking the limit of the result as_e— 0. Following through this 
calculation, show that the cross section corresponds to the Rutherford 
cross section 


4Ze*m? Zet 
aun = gk ~ Tm? 72)sin 7T عد‎ 


where e = charge on an electron 
q = 2p sin (6/2) = 2mu sin (0/2) (6-47) 
0 = angle between the vectors ig and i, 


The result of Prob. 6-6 is, accidentally, exact. That is, the first- 
order Born approximation gives the exact value for the probability 
of scattering in a Coulomb potential. This does not mean the higher- 
order terms are zero; it means, rather, that they contribute only 
to the phase of the scattering amplitude. Since the probability is the 
absolute square of the amplitude, it is independent of the phase. Thus 
a first-order Born approximation, which gives the correct value for the 
probability, is not exact for the amplitude. This case of a Coulomb 
scattering is amusing, for there is also another accident. A completely 
classical treatment of such a scattering problem, i.e., treating the 
electrons as charged point masses, gives the same result. 


Problem 6-7 Suppose the potential V(r) is the result of a charge 
distribution p(r) so that 


VV (r) = 4rep(r) (6-48) 
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By assuming that p(r) goes to 0 as |r| > ©, multiplying Eq. (6-48) 
by exp (iq - r/ñ), and integrating twice over r, show that v(q) can be 
expressed in terms of p as 


252 fr 
v(a) = SEES f esimer(e) dr (6-49) 


An atom can be represented in terms of its charge density. At 
the nucleus the charge density is singular, so that it can be represented 
as a 6 function of r of strength Z, where Z is the charge on the nucleus. 
Then if pe is the density of atomic electrons, v(q) is 


arr | (6-50)‏ توووم FEES z — fT‏ = روم 


The quantity in the brackets is called the form factor for electron 
scattering. (Incidentally, a similar form factor appears in X-ray 
scattering. The theory of X-ray scattering shows that only the atomic 
electrons, and not the nucleus, contribute to the scattering. Thus the 
form factor for X-ray scattering is the same but with the Z omitted.) 

In an atom the potential follows the Coulomb law only for very 
small radii. As the radius is increased the atomic electrons gradually 
shield, or cancel out, the nuclear charge until, for sufficiently large 
values of r, the potential is zero. The shielding effect of atomic 
electrons can be accounted for in a very rough approximate manner 
with the formula 


2 
V(r) = ew (6-51) 


In this expression a is called the radius of the atom. It is not the 
same as the outer radius of the atom as used by chemists, but instead 
is given by ao/Z*, where ao = ň?/me? = 0.528 A. 


Problem 6-8 Show that in such a potential 
4nZe*h? 


v(q) = T + A/a) (6-52) 
and hence 
o = 4 g E (sin 5) + oll (6-53) 


The total cross section or is defined as the integral of o over the unit 
sphere; thus 


år 
or = | odg (6-54) 
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In the present example show that ep is given by 
Z*e*/(2Quh)? 
= ا‎ 
or = ma? 5  #/ Cpa)’ )6-55( 


Problem 6-9 Suppose we introduce the fact that the atomic 
nucleus has, a finite radius given by 


r = 1.2 X 10-13 X (mass number)” em (6-56) 


and assume that the nuclear charge is distributed approximately 
uniformly in a sphere of this radius. What is the effect of this assump- 
tion on the cross section for the scattering of electrons by atoms at 
large values of the momentum transfer q? 

Show how the nuclear radius can be determined along with some 
of the details of the nuclear charge distribution by making use of this 
effect. How large must the momentum p of the incoming electrons 
be in order to produce an appreciable effect? Would one observe 
more carefully the large or small scattering angles? Why? 

Note: In this type of experiment the required electron momentum is 
so high that actually the relativistic formula E = VRC TCD mc? 
must be used to find the energy. So, strictly, we should not be 
allowed to use nonrelativistic formulas to describe the interaction. 
However, the relations between momentum and wavelength and 
between energy and frequency are not changed in the relativistic 
region. Since it is the wavelength which determines the resolving 
power of this “electron microscope,” the momentum calculated by 
nonrelativistic formulas is still correct. 


Problem 6-10 Consider a diatomic molecule containing two 
atoms, A and B, arranged with their centers at the points given by the 
vectors a and b. Using the Born approximation, show that the ampli- 
tude for an electron to be scattered from such a molecule is 


K® = etraf, (q) + ef (q) (6-57( 


where fa and fg are the amplitudes for scattering by the two atoms 
individually when each atom is located at the center of a coordinate 
system. The atomic binding does not change the charge distributions 
around the nuclei very much (except for very light nuclei such as 
hydrogen) because the binding forces affect only a few of the outermost 
electrons. 
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Using Eq. (6-57), show that the probability of scattering at a par- 
ticular value of q is proportional to fa? + fs? + 2fafs cos (q - d), where 
dis a — b. 


The values of f computed by the Born approximation are real and 
the result is valid for electron energies usually used in diffraction 
experiments on molecules (the order of 1 kev). However, if the 
molecule includes the very heaviest atoms, such as uranium, the atomic 
potential is too large for the results to be adequately described by the 
Born approximation, and small corrections are necessary. 


Problem 6-11 Suppose the molecules are oriented in a random 
fashion. Show that the electron scattering averaged over a group 
of such molecules is proportional to fa? + fs? + 2fafe (sin q ° d)/q-d. 
How can this result be generalized to the case of polyatomic molecules? 


These results form the basis of electron diffraction techniques which 
make possible the determination of the form of molecules. 


Problem 6-12 Assume that V(r) is independent of time and show 
that the time integral of the second-order scattering term K“(b,a) 


gives 
dle فق‎ EN ee لات‎ 
K (b,a) (5) QrihT i J TedT acl bd 


x | exp (srr) lra Paras na)? V(r.) V (ra) dîr. dra (6-58) 
where the points a, b, c, and d are arranged as shown in Fig. 6-9. The 
term reg stands for the distance between the point c and d, etc. 

Assume that V(r) becomes negligibly small at distances which are 
short compared to R, or Ry. Show that the cross section is given by 
g = |f|?, where the scattering amplitude f, including the first-order 
term, is 


0 eh) por V (r)e il) Por dîr 


m 2 Te Yd 3 1 6 
aes —(i/A) port, s= CIA) pre 
as ف‎ i) 1 g~ (il peta V (ra) (=) eli prea 


- V(r.)e/ Pate d3r, d'ra + higher-order terms (6-59) 


m 
f= oR 


where p, is the momentum of the electron traveling in the direction of 
R, and p, is the momentum of an electron traveling in the direction 
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Fig. 6-9 To increase the accuracy of scattering calculations, we can take account of 
second-order terms in the perturbation expansion. Here, as in Fig. 6-2 (3), we picture the 
electron as being scattered at two separate points in the atomic potential. Thus the elec- 
tron starts at a; proceeds as a free particle to c, where it is scattered; then moves as a free 
particle to d, where it is scattered again; and finally moves as a free particle to b, where it is 
collected by the counter. The points c and d can lie at any position in space. The atomic 
potential at these positions depends upon the radius vectors r. and ra, measured from the 
center of the atom O. 


—R,. The magnitude of the momentum is p, and it is approximately 
unchanged by an elastic scattering of the electron from the (relatively 
massive) atom. 


One might expect that in a situation in which the Born approxima- 
tion is not adequate it would be worthwhile to compute the second- 
order term as a correction. But in practice it seems that in this 
application Eq. (6-59) is a kind of asymptotic series. If the second 
term makes an appreciable correction (say 10 per cent or more) the 
higher terms are not much smaller and the true correction cannot be 
gotten easily by this method. Of course, if it is a problem in which 
the errors of the Born approximation are small (say less than 1 per 
cent), the second term will be adequate to find the corrections. 


The Wave Function Treatment of Scattering. In the scatter- 
ing experiment which we have described we have assumed that the 
initial state of the incoming electron was that of a free particle with 
momentum pa. We have assumed that the value of the momentum is 
determined by a time-of-flight technique (i.e., the total time required 
to travel the distance Ra + Fè is T). 

It is not necessary to use such a technique. Any device which 
enables us to determine the momentum is equally satisfactory. So 
suppose we generalize our picture of scattering phenomena with the 
help of the wave function method. 

Suppose the incoming electrons are known to have momentum pa 
and energy E. = Pa?/2m. Thus the wave function for the incoming 
electrons is 
ba = etl Pate CNN Eat (6-60) 
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Then, using the first two terms of Eq. (6-25), the wave function to 
first order for the outgoing electrons is 


W(Ro,ty) = eC Pa Rogli) Bots 


= 1 fè [E Ratt dV (erreme dar dt (6-61) 


The first term in this expression represents the wave of free particles 
which have passed through the potential region without being scat- 
tered. The second term is the amplitude of the scattered electrons. 
If y, represents this term, then y, is the scattered wave. 


Problem 6-13 Assume V(r,t) is actually independent of t. Sub- 
stituting for the free-particle kernel Ko in Eq. (6-61), integrate the 
result over ¢ to show that 


V(Rs,fo) = el Bots اا‎ 


ae IT 1 ze + g(ilD proe V (ra) افاج‎ 70 Porte dr. | )6-62( 
where rs. is the distance from the final point b to the variable point of 
integration c and p is the magnitude of the momentum of the electron. 

Once more, suppose that the potential drops to 0 for distances 
which are short compared to either Ra or Rẹ. Show that Eq. (6-62) 
can be written as 


l : gil) مغلم‎ 
Y(Ro,t) = elDBstgilDpa'Rs 4. f )6-63( 
b 
where the scattering amplitude f is defined in terms of v(q) [see Eq. 
(6-39)] as 
m 
f= x55 0(a) (6-64) 


The last term of Eq. (6-63), (f/ R») exp (ipRs/h), can be thought 
of as the spatial part of the scattered wave function. It has the form 
of a spherical wave radiating outward from the center of the scattering 
atom. The amplitude of this spherical wave at some particular 
scattering angle depends upon that angle through the function ‘bp 
which, by Eq. (6-64), varies with the momentum transfer q. Thus 
the complete wave functions for the electrons after scattering can be 
thought of as the sum of two terms. The first term is the plane wave 
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Fig. 6-10 A beam of electrons, represented by its equivalent wave, moves toward the 
atomic nucleus at O. The strongest fraction of the beam moves on undisturbed as a plane 
wave with momentum Pa. A small fraction of the beam is scattered from the atomic 
nucleus and moves away from O as a spherical wave pattern. The resulting strength of 
the wave of electrons (the number of electrons) at some point b located at R,, measured 
from the atomic nucleus O, is then made up of two parts. The first is the nonscattered 
beam given by the plane wave exp (ipa * R,/f). To this is added the scattered wave with 
the spherical form given by (1/R,) exp (¢pR,/h) and multiplied by the function f, which 
determines the angular dependence. The combination of these two waves gives the 
spatial part of the scattered wave function. 


of nonscattered electrons, exp (ipa- R;/f), and the second term is the 
spherical wave of scattered electrons, as indicated in Fig. 6-10. Use 
this point of view to derive the formula for the cross section 7. 


Problem 6-14 Use the wave function approach to discuss the 
scattering of an electron from a sinusoidally oscillating field whose 
potential is given by 


V(r,t) = U(r) cos wt (6-65) 


Show that in the first-order Born approximation the energy of the out- 
going wave is changed by either +w or —w. What happens in the 
higher-order terms? 


6-5 TIME - DEPENDENT PERTURBATIONS 
AND TRANSITION AMPLITUDES 


The Transition Amplitude. An especially useful form of the 
perturbation theory occurs if the unperturbed problem corresponds to a 
potential U independent of time, for then we have seen in Eq. (4-59) 
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that the unperturbed kernel can be expanded as (now in one dimension 
for convenience) 


Ku(2,1) = إلا‎ bale) عو‎ (T1) FIM for te > tı (6-66) 


in terms of the eigenfunctions $n and eigenvalues E, of the unperturbed 
problem. Let us look at our series for Ky(2,1) after substituting this 
expression for Ky. Writing out the first two terms, it is 


Ky(2,1) = 0 hn(L2) b* (x1) سج‎ CEA (tt) 


- 5D, J f nlad oes) V (Eata) Po! 9 by (5) (2) 
X e~ GEnh tt) dx, dta + ° ° ° (6-67) 


It is clear that in each term the zı will appear in some wave function, 
like $*(x1), and the zə likewise, so we can always write Ky in the form 


Ky(2,1) = >) > و‎ bm (22) 6* (a1) (6-68) 


where the )’s are coefficients depending on fe, tı. We shall call these 
coefficients transition amplitudes. To zero order in V, this must reduce 
to Ky, so to this order Amn = mne CEY, If we expand A in a 
series in increasing orders of V, we have 


Amn = Sinn COAL (et) 7 EAL: Sr A => SPs (6-69) 
and comparison to Eq. (6-67) shows 


Ama) = — ; 1 "i fe ož (23) V (3,3) Pn (23) dzz eC Enlt) -Ent dt, 
(6-70) 
Problem 6-15 Recall that in Prob. 5-4 we defined a particular 
integral as the transition amplitude to go from state y(x) to state 
x(x). Show that the function Amn satisfies this definition when the 


initial state is the eigenfunction ¢,(x) and the final state is the eigen- 
function ¢,,(2). 


Define, for brevity, 


Van(t) = |, 04E) V (ata) dnl) dzs (6-71) 
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(This is sometimes called the matrix element of V between states n and 
m.) Then Eq. (6-70) can be written as 


t EFP g tz : 
Amn’) = — z“ (G/N E mtret (ilà) Enti i V mn(ta)e C/A (Em—En)ts dts )6-72( 


This is an important result of the time-dependent perturbation theory. 
The coefficient Amn is the amplitude for the system to be found in state 
m at time ta if initially the system is in state n. 

Suppose the wave function at tı was ¢„(xı). What is it at to? 
Using Eq. (3-42), we can express the wave function at tz as 


K4 Kv(2,1)%n(zı) dai = > > \rr Pr (z2) T3 $7 (21) (رنه) ره‎ dzı 
ko 


= D )ف ا‎ (6-73) 
k 


That is, the wave function at tə is in the form 2 Cmbm(X2). 


This expansion in terms of eigenfunctions was first introduced in 
Eq. (4-48). Now we can assign a deeper meaning to the constants Cm. 
We can interpret Cm as the amplitude that the system is in state dn. 
In this particular case, Cm = Amn is the amplitude to be in the state 
$m at time tz if at tı the state is dp. 

With no potential acting, a system once in state n is always in 
state n with an amplitude varying in time. So, to zero order, 


Aan = Sn EnA (ts—t) 


We can interpret the first-order term by the rule (Fig. 6-11): The 
amplitude to be scattered from state n to m in a time dt is —(t/h) V mn(t) dt. 


Fig. 6-11 A system initially at 
the nth energy level is subjected 
to a potential V which ‘“scat- 
ne ters” the system into all of the 
states available to it. The 
amplitude for a scattering into 
the kth state, for example, is 
proportional to Vin. In par- 
^-1 ticular, the amplitude to be 
scattered from the state n to 
the state m in the time interval 
dt is —(i/h) V mn(t) dt. 


Initial state Final state 
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Problem 6-16 Interpret Eq. (6-71) as a sum over alternatives; 
i.e., identify the alternatives. 


Problem 6-17 Interpret Eq. (6-72) by explaining the meaning of 
each term. Then explain and verify the equation for the second-order 
coefficient : 


1 te t4 1 
لدي هك ور‎ Sa — (il) Bm (tots) 
Amn 72 ip [ 2 6 V mx (ta) 
X فاسع‎ Bate) Vpn (t4)e (iD Balt) dt | dt, (6-74) 
Problem 6-18 Derive and interpret the integral equation 


Nmn(to,t1) = mnel En(t—ty) 
z F Ra > V mu (ts)Nen(ts,t1) dts (6-75) 


k 


Problem 6-19 Consider )ymn(t2) as a function of the final time tz. 
Show, using either Eq. (6-75) or Eq. (6-69), 


d 2 À : 2 
کے کے خد‎ ee (i/h) ) 37 ليم‎ pta V اك‎ S 
dts Amn (te) h : 6 t 1 mk (t2) Nn (te) h Em\mn (te) (6 76) 


Give a direct physical interpretation of this result. Next, deduce 
this result directly from the Schrödinger equation. Hint: Use Eq. 
(6-73) and substitute into the Schrödinger equation. Note that 
Eq. (6-76), with the initial condition Amn(t1) = ômn, could be used to 
determine the X's directly. 


We can interpret all of the terms in Eq. (6-69) by the rule that 
—(t/h) V mn(t) dt is the amplitude that the potential V will scatter (or 
induce a transition) from a state n to a state m during the time interval 
dt. We can go from the state n to the state m DVO E E 0 
more scatterings. We can go directly between the two states, Le., 
with no scatterings, only ifm = n. Thus the first term in the expan- 
sion is proportional to ômn. 

The second term, given by Eq. (6-72), gives the amplitude that 
the transition will take place as the result of a single scattering. 
The amplitude for the particle to be found in the initial state n is 
exp [—iE,(ts — t:)/A] at the time ts. (In this case the phrase “to be 
found in the state n” should be interpreted as “available for scattering 
from the state n by the potential V.”) The amplitude to be scattered 
by the potential V between the state n and the state m is (—t/h) Vinn- 
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Finally, the amplitude to be found in the state m (which in this case 
means “the amplitude that the state m shall be available to the particle 
at the time the scattering takes place”) at the time t; is proportional 
to exp [—iEm(t2 — t3)/h]. This scattering can take place at any time 
between tı and ¢. Therefore, an integration over the time t; is earried 
out between these two end points. 

The third term, given by Eq. (6-74), is the amplitude for a transi- 
tion as the result of a double, or second-order, scattering. The 
first scattering takes the system from its initial state n to the inter- 
mediate state k at the time t}. The system then stays in this state 
until the time t4, when its availability for scattering is again meas- 
ured by an exponential function, exp [(—72/h) E(t, — ts)]. Another 
scattering takes place at the time-t, and carries the system from the 
state k to the state n. We integrate over all of the possible alternate 
times for the scatterings t4 and ts, requiring only that the time ts shall 
be earlier than the time ty. Next, we add over all the possible states 
k into which the system may have been scattered in the intermediate 
interval. 

The terms of Eq. (6-69), which we have just interpreted, give the 
results of the general time-dependent perturbation theory. Itis 
applicable when the unperturbed system has a constant hamiltonian and 
thus definite energy values. Next, we shall study some special cases 
of this theory in more detail. 


First-order Transitions. First, let us take the case that the final 
state m is different from the initial state n and let us consider only 
the first Born approximation, i.e., the second term in Eq. (6-69). 
The result will be applicable for small values of V. The amplitude 
that we make the transition from m to n is 


Lan? = — = i! a eG) Bn-Emt V(t) dt e (i (Ents Eat (6-77) 
ti 


This is a very important special formula in the time-dependent pertur- 
bation theory. Suppose as a first example that V (x,t) = V(x) is not 
an explicit function of time. If we take the interval of time from 0 to 
T, then since Vmn is constant, we have 


Xn DE!) Ent Ent) = — ; Vx f T SGD EnEn)t dt 
0 


eCIAEn—EmdT — 1 


=) e E 6-78 
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F(t) 


0 4 
if 


Fig. 6-12 The potential affecting the transition from m to n is turned on and off slowly 
with the time variation f(t), shown here. As this time factor becomes smoother (e.g., as 
discontinuities appear in successively higher derivatives) the probability of a transition 
becomes smaller. 


The probability of a transition during the time interval T is then 


Pee ny a poe ar فر‎ E int Get | (E, — En)? 
(6-79) 


2h 

We see that for at least a long interval T this probability is a rapidly 
oscillating function of the energy difference E, — Em. If E, and En 
differ appreciably, it is very small, i.e., if |Vmn|<«<|En— Enl. This 
means that the probability that the energy in the final state will be 
modified appreciably from that in the initial state by a very weak 
constant perturbation is very small. One might ask: How can the 
energy be expected to change at all by the large amount Em — En 
as a result of the small disturbance Vma? The answer is that we have 
considered V to start suddenly at the time t = 0, and the definiteness 
of this time permits, by the uncertainty principle, a large uncertainty 
in the energy [cf. Eq. (5-19) and associated discussion]. 


Problem 6-20 Suppose V is turned on and off slowly. For 
example, let V(x,t) = V(x)f(), where f(t) is smooth, as shown in 


Fig. 6-12. 
f(D = Yer fort <0 
= 1 — We fro > <> 


= 1 10 لسع‎ )8-0 for > E E Ik 


Jae (n. TOT SRL (6-80) 


Suppose that 1/y, which is the time of rise of the function f(t), >> T, 
but suppose further that y < (Em — En). Show that the probability 
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given by Eq. (6-79) is reduced by a factor {y?/[y? + (Em — E,)?]}?. 
In this definition of f( we still have a discontinuity in the second 
derivative with respect to time. Smoother functions make still 
further reductions. 


If it should happen that Em and E, are exactly the same energy, we 
find P(n— m) = |Vinn|?T?/h?. This grows as the square of the time. 
It means that a concept of the transition probability per unit time is 
not meaningful in this case. This formula holds only for T short 
enough that VmaıT <f. It turns out that if only two states of exactly 
the same perturbed energy are involved, the probability of being 
found in the first goes as cos? (|Vm,|Z'/h) and of being found in the 
second as sin? (|Vmn|Z'/%), while our formula is only a first approxima- 
tion to this. 


Problem 6-21 Consider the special case that the perturbing 
potential V has no matrix elements except between the two levels 
1 and 2; and further, suppose these levels are degenerate, that is, 
suppose Kı = Hy. Let Vı = Voi = v and let Vii, Vor and all other 
Vn be 0. Show that 


vT? 0/04 vT 
يد‎ GD a aia اما‎ 
or „vT? TEGIN 
A12 = TAR + 07 6 = = —-1 emei (6-82) 


Problem 6-22 In Prob. 6-21 we have Viz = Vz, so that Vız is 
real. Show that even if Vız is complex, the physical results are the 
same (let v = |V12]). 


Such systems swing back and forth from one state to the other. A 
further conclusion can be drawn from this result. Suppose the per- 
turbation acts for an extremely long time so that VinnT/h >> 1. Then 
if the system is investigated at an arbitrary time T, which is somewhat 
indefinite, the probabilities of being in either the first or second state 
are, on the average, equal. That is, a small indefinite perturbation 
acting for a very long time between two states at the same energy 
makes these states have equal probability. This will be useful when 
we discuss the theory of statistical mechanics in Chap. 10. 

The case of great importance is that in which the values allowed 
for Em, the energy of the final state, are not separate and discrete 
but lie in a continuum, or at least are extremely closely spaced. Let 
us say that p(B) dE is the number of levels or states in the range of 
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energy E to E + dE. Then we can ask for the probability to go to 
some state in this continuum. First we see that to go to any state 
for which E, — Em is large is very unlikely. It is most likely that the 
final state will be one of nearly the same energy as the original En 
(within an error + Vma). The total chance to go into any state is 


È gp an ee 


» P(n— m) = 
m=1 
eaves 557 p(En)dEn (6-83) 

The quantity {4 sin? [(Em — En) T/2h]}/ (Em — En)? is very large if 
Em ~ En, reaching a maximum of 12/112, whereas it is much smaller if 
En and E, differ appreciably (relative to A/T), as shown in Fig. 6-13. 
Thus almost all the contribution to the integral on Em comes when 
Emn is in the neighborhood of the value En. 

If Vmn does not vary rapidly so that we can replace it by a typical 
value, and furthermore if p(L,,) likewise does not vary rapidly, then 
to a good approximation we can replace the integral of Eq. (6-83) 
by the expression 


En sin? [(Em — En) T/2h] 
UVa f °° Sarg dm 
Since ii al [(sin? x) /x?] dx = r, the integral of Eq. (6-84) has the value 
71/211 and we obtain the result that the probability for a transition to 


(6-84) 


sin? x 
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Fig. 6-13 In this figure the energy difference Em — Eq is replaced by the variable zx. 
When these two energies are approximately equal (thus z is very small) the function 
(sin? x) /x? approaches its maximum value. For large values of the difference the function 
becomes very small. Thus, in expressions involving this function, the most important 
contributions come from the central region, that is, the region where the two energies are 
approximately equal. 
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some state in the continuum is 
P(n — m) = 2a | V mnl? aon (6-85) 


and that the energy in the final state is the same as the energy in the 
original state. 

From these results we can write the probability of a transition per 
unit time in the form 


e = 2 |Mn—m|20(E) (6-86) 


where M,_,» is called the matrix element for the transition and p(B) 
is the density of levels in the final-state. In our case Mr, iS Vinn- 
If we went to a higher-order expansion of Nmn, it would be more com- 
plicated. Another way to write this expression is that the probability 
of a transition per unit time from state n to some particular state m is 


dP(n—m) _ 2n6(E, — Em)|M nml? 
dt 3 0 

Then when we sum over a group of states m, only those with En = Em 

survive. Since 0 — i; dEm p(Em), we get as a result Eq. (6-86). 


(6-87) 


We may illustrate Eq. (6-86) by an example which we have pre- 
viously discussed from a different point of view, namely, the scat- 
tering of an electron in a potential (cf. Sec. 6-4). Suppose an other- 
wise free particle has an interaction with a central potential V(r) 
and we wish to discuss the scattering of this particle from an initial 
state of a definite momentum to final state in a new direction with 
another definite momentum. We suppose that the state n, the 
initial state, is a plane wave of momentum py, so that the wave function 
gn is exp (tpi-r/h) (which is normalized so that the integral of the 
absolute square of ¢, over a unit volume is unity). Likewise, suppose 
the final state is a plane wave of momentum p» so that the wave func- 
tion $m is exp (¢p2-r/h). The matrix element Vmn is 


Vina = f " e-GMPet V(r) eM dêr 
= v(q) (6-88) 


where q = وم‎ — pı. In the scattering, the energy will be conserved so 
that p2?/2m = pı?/2m. This means that the magnitudes of momenta 
pi and وم‎ are the same. Let us set them equal to p, so that 


[p2| = |pi] = p 
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By our usual convention for writing differential elements of momen- 
tum, the number of states which have their momenta in the volume 
element of momentum space dp» is d*ps/(2rh)* = p? dp dQ/(2rh)', 
where dQ is the element of solid angle which contains the momentum 
vector .وم‎ An element dE of the energy range is connected to the 
element of momentum space by ` 


dE = P’ _ pap (6-89) 


Thus the density of momentum states for particles traveling into the 
solid angle dQ is 


)6-90( ارو امس افد p(y‏ 


Substituting these relations into Eq. (6-86), we find the probability of 
transition per second into the element of solid angle dQ is given by 


dP ILAE ' 
T= (ap) rebo! (6-91) 


We define an effective target area or cross section for scattering 
into dQ as do (cf. Sec. 6-4). The number of particles that will hit 
this area per second is the cross-sectional area times the velocity 
of the particles coming in, uı = pi/m. This is because we have started 
with wave functions ¢, which were normalized on a unit volume; or in 
other words, the relative probability is unity to find one particle in 
any unit volume. Thus 


“y d2 = u do = Pde (6-92) 


Therefore we obtain for the cross section the result 


d 
rive (e sm) CE (6-93) 
which is just what we obtained in Eq. (6-44). 


Problem 6-23 Show that the same result is obtained for 0/9 
even if the wave functions ¢, are normalized to give a unit probability 
for some arbitrary volume V. 


Problem 6-24 Suppose that the potential V is periodic in time. 
For example, suppose V(z,t) = V(x) (et + e*t). Show that the 
probability for a transition to take place is small unless the final state 
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is one of the two values (1) Etna = Einitiaı + fiw (corresponding to an 
absorption of energy) or (2) Etna = Einitiaa — Aw (corresponding to 
the emission of energy). This means that Eq. (6-86) is unchanged, 
but the density p(£) of states must be calculated at these new values 
of E. Or, in analogy with Eq. (6-87), we have 


[En — ħu) + 5(Em - En + hw)]‏ الل اعد = Sea‏ د 
)6-94( 


Problem 6-25 It has been argued that the equations of the elec- 
trodynamics must, like those of mechanics, be converted to a quantized 
form on the basis of the photoelectric effect. Here an electron of 
energy fiw is occasionally emitted from a thin layer of metal under 
the influence of light of frequency w. Is this impossible if matter 
obeys the quantum laws but light is still represented as a continuous 
wave? What arguments can you adduce for the necessity of giving 
up a classical description of electrodynamics, in view of the results of 
Prob. 6-24? 


Problem 6-26 Suppose we have two discrete energy levels Æ, 
and FE», neither of which is in the continuum. Let a transition be 
induced by a potential of the form V (x,t) = V(x)f(). Show that the 
probability of transition is 


P(1 ج‎ 2) = |Vi2|?|¢(wo)|? (6-95) 


if f(t) is representable by the Fourier transform 


i) = f swe S (6-96) 
and wo = (E - 


If f(t) is a statistically irregular function familiar from the theory 
of noise (called filtered white noise), the value of ¢(w) given by the 
inverse transform 


olo) = | prO at (6-97) 


depends on the range T of integration of the time ¢. If T is very large, 
|¢(wo) |3 can be shown to be proportional to T. Thus we get a transi- 
tion probability proportional to the time and the “intensity” or 
“power” (mean-square value of f per second) at frequency wo per unit 
frequency range. In virtue of this, the probability for the transition 
of an atom in a continuous spectrum of light is proportional to (1) 
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the exposure time and (2) the intensity of light at the frequency 
(E — Ey)/h of absorption. 


The Higher-order Terms. It is interesting to look at the second- 
order term in the perturbation expansion. This term is of special 
importance in the problems wheré Vma = 0 for those particular states 
mand n of interest. Let us suppose that we have such a problem, and 
suppose further that there are other states k كد‎ m for which Vim # 0. 
The first-order term is 0, and so long as n كد‎ m the zero-order term is 
likewise zero. Thus the lowest-order term which enters into the 
calculation of the transition amplitude is the second. 

Suppose that the potential V is independent of t. Then the second- 
order term in the transition element is Ama‘; and if T = ta — ti, we 
have from Eq. (6-74) 


et G/M) Emt Enty )9( = — D VEVA if dts A dts 
k 


x elt) (£y,—E,) tag (i/) (E,—En)ts 


Il 


D) 1 ع‎ eC By —Eta(e(i/t) EE — 1) 
k 


dts 
a a 


e 2 VineVien [eC En-EdT — 1 
E, — Rece Em = En 
elilt)(En-E)T — 1 


sige ere) CS 


The first of the two terms in the last factor of this result has the 
same time dependence as we have seen in our first-order result. There- 
fore if the other term is neglected for a moment, we see that the net 
result would again be to make transitions to states where Em = En, 
with a probability proportional to 7. The probability per unit time 
is of the same form as Eq. (6-86) but with M,_,. now given by 


Viet an 
MEET DIEE EE E, (6-99) 


If we assume that the states lie in a continuum, the sum becomes an 
integral. Equation (6-99) is correct in the circumstance that it is 
impossible to go by first-order transition, impossible not only to the 


. state m but also to any state k of the same energy as the initial state. 
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Under these circumstances Vz, = 0 for states such that EH, = En. 
Then the second term in Eq. (6-98) is never large; for it cannot be 
large unless En — E, is nearly zero, and then Vz, in the numerator is 
zero. All the effects come from the first term, and Eq. (6-99) is 
correct. Furthermore, in the sum over k in Eq. (6-98) there is no 
ambiguity at the pole where E, = Em; for the numerator vanishes 
at this same value of Ep. 

On the other hand, in some situations it may be true that a first- 
order transition is possible to some other continuum state (e.g., a 
nucleus may decay in more than one way). In such a case the sum in 
Eq. (6-99) is meaningless; for we must define what to do near the pole. 
It is the neglected second term in Eq. (6-98) which comes to our rescue 
here and shows that the correct expression for Mn4m (now including 
the first-order term also for generality) is 


Mae = Kan Sta 0 De Vnt Vin (6-100) 
k 


— En — te 


in the limit e— 0. How this comes about we shall now analyze. 
First we may notice that for large T we cannot get a large probabil- 
ity of transition (proportional to T, that is) unless E, and Em are 
practically equal (within about H/T). This is evident for the first 
term in Eq. (6-98). For the second term large amplitudes can arise 
only if E, ~ En; but if Em is not very close to En, the factor in front 
is a smooth function of E, for E, near Em. Taking it as nearly con- 
stant for a small range near Ep = Em, we see that the second term can 
be approximated by some constant times 
e@t/heT 1 


de 
€ 


where e = (Em — E+) is to be integrated over a small range, say — ô 


to +6. But 
è etlyeT — 1 £ 31/3. e — 1 a 6T/h [| COSY — 1 isin y 0 
fee € e = 7 < A ae y a y ) s 
(6-101) 


The first integral is that of an odd function and vanishes. The second 
approaches a finite limit as T— œ (and therefore as ôT/ň— œ). 
That is, 


ARN را‎ es 
2i f pa للا‎ 


Quantum mechanics and path integrals 


so no large transition probability occurs. A large effect can arise only 
in case E, and En are essentially equal, for then the double coincidence 
of the two poles from (E, — E,) ^ and (Em — E)! can make the 
second term important. Therefore, we continue the analysis, assum- 
ing Em and E, are nearly equal. 

The sum of k in Eq. (6-98) can be divided into two regions by 
choosing a very small energy A and breaking the sum up into a part A 
for which |E, — E,| >A and a part B for which |E, — E,| < A. 
We choose A to be small enough that the factor VingVin does not vary 
appreciably when E, varies around E, over this energy range 2A. 
This is some finite energy, and we shall take T so long that H/T < A, 
which means that |Z, — Em| < A. 

First for part A, |Z, — Em| >A. Then the second term cannot 
become large; for its poles are avoided. Only the first contributes, 
and the contribution is 


er is 
yh 
where z = (Em — E,)T/h and 
(A) 


DT 


The sum extends over all E, except for within +A of Em. This sum is 
nearly independent of A, and as A — 0 it is the definition of a principal- 
value integral. That is, in the limit A »ك‎ 0 we can write 


e 
Er — Ep 


a )6-102( 


GF > Paran P.P. (6-103) 
k 


where P.P. is the principal part and we have reinstated the first-order 
term, in case it does not vanish. 
For the region B we take VinzVin to be constant at its value for 
(B) 


Hx — En = 0. That is, we replace 2 Vink Vien (Ez) by 
k 
Em+A 
[> Var Vand (Ey — En) | fe, F(E) dEr (6-104) 
k 


We write this as bJ, where 


b = 2 VineV ind(Ey — Em) (6-105) 
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Now we put (Em — E,)(T/h) = x and (Es — E,)(T/h) = y, so that 
(En — E,)(T/h) = x — y, to get 


AT/h dy ez — 1 ER | 
Th an AT/à 4 = Ey ) (6-107) 


This integral is most easily evaluated by contour integration, 
imagining y as a complex variable and changing the contour. Instead 
of integrating on the straight line from - 41/7 to 41/7, we go on the 
semicircle of radius 47/7 below the real axis. Since 417/7 is very 
large, the second term contributes negligibly; and since 


eee dy 
AT/h Y eee 


on this contour, we get J = ir(T/h)(e* ع‎ 1)/x. Putting the A and 
B parts together, we get 


(a + irb) cone 


(6-108) 
for the amplitude. This gives a probability for transition of the form 
Eq. (6-86) with 


AE en IG zE irb = KA sie 0 Vier Vis bye or = in(Er A Bn) | 
7 n 
)6-109( 


The last bracket can be written (Es — Em — 2-4 in the limit as 
جع‎ 0, as we have written in Eq. (6-100). 

From Eq. (6-100) we learn then that even if no direct transition 
is possible from n — m, nevertheless the transition can occur, as we 
say, through a virtual state. That is, we can imagine that the system 
goes from n to k, then k to m. The amplitude for an indirect transi- 
tion process is given by Eq. (6-99). We note that it is not right to say 
that it actually goes through one or another intermediate state k, 
but rather that in characteristic quantum-mechanical fashion there 
is a certain amplitude to go via the various intermediate states k, and 
the contributions interfere. 

The intermediate states are not of the same energy as the initial 
and final states. The conservation of energy is not violated, for the 
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virtual state is not permanently occupied. The strength of contribu- 
tion to the sum varies inversely with this energy discrepancy. 

There is nothing absolute about these intermediate states. They 
come from considering V as a perturbation to a system H and from 
speaking about the true states of H + V in terms of those of H alone. 
If other separations are made as to what is the “unperturbed” problem 
and what is the “perturbation,” different formulas and intermediate 
states will arise in the description. 

When the potential depends upon time (e.g., periodically), many 
interesting effects result. Most of these have been observed in micro- 
wave experiments, where the perturbation V(z,t) is a weak electric 
or magnetic field with a periodic variation in time. 


Problem 6-27 Derive the perturbation expansion up through 
the terms of the second order for potentials periodic in time. 


Sometimes a transition cannot take place except by the use of two 
or more intermediate virtual states. Analysis of such transitions 
requires the calculation of third- and higher-order terms in the pertur- 
bation expansion. 


Problem 6-28 Show that when a transition is impossible either 
directly or through a single intermediate state, but requires the use 
of two intermediate states, it is determined by the matrix element 
given by 


V meV x V n 
Mim =) E = EE BD a) 


This corresponds to the third-order term in the perturbation expansion. 


Problem 6-29 Suppose two perturbations are acting, V(z,t) and 
U(x,t), for example, a combination of d-c and a-c electric fields or a 
combination of electric and magnetic fields. Suppose further that a 
certain transition cannot occur with either V or U alone, but can 
occur only when both act together. Under the special assumption 
that both V and U are constant in time, show that the matrix element 
determining the transition element is given by 


ee SF Unk V kn 
Manon = em ey aaa (6-111) 
Next, suppose both potentials are periodic in time but have different 
frequencies, wı and we. What then is the matrix element? 
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Calculation of the Change in Energy of the State. In com- 
puting transition amplitudes we have considered only those states 
n Æ m. Suppose we turn our attention to the term m = n. Con- 
sidering the zero- and first-order terms in the perturbation expansion, 
we have 


ifr 
Ann = 1—'5 if Vinm(t) dt (6-112) 


If V is constant in time, this gives 1 — (¢/h)VmmT. What is the 
meaning of this result? As a consequence of the introduction of 
the additional potential V into the original hamiltonian we can 
expect the energies of all of the states of the system to be slightly 
altered. We can write the new energy of the state m as Em + AEm. 
The time-dependent portion of the wave function describing this 
state will be exp [(—7/h) )2 + AE,,)t] instead of the previous function 
exp (—1/h) Ent. 

Over the period of time T during which the perturbing potential acts 
this relative difference in phase introduces the factor 


exp [(—7/h) AE, T] 


Expanding this factor to first order in time gives 1 — (¢/h) AE, T. 
Thus we see that a first-order calculation of the energy shift in a state 
m due to a perturbation V is 


AEn = Vinm (6-113) 


This derivation of the first-order energy shift is not satisfactory 
if the system is degenerate, i.e., if there are initially very many states 
of exactly the same energy. It turns out that in such a case terms of 
second order in V give equally large effects. 

Adding in the second-order term in the perturbation expansion for 
the transition element gives 


2 
7 Komt 


F (G) 0 ie He ad?) (E, Ea) (tots) dis 014 V mk Vim 
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(6-114) 


e UM EnT) am = 1 — 


For the present let us assume that there is no degeneracy. Consider 
first the term k = m in the series which is the second-order term. The 
integral over this particular term is just T?/2. Integrals for the 
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terms k = m can also be performed easily to give the result 

2 — 1 2m2 

0 ape Vee T 

2 il View|? T 1 — exp [—77(Ei — للكت‎ 
» 02 (t/h) (Er — Em) 


(6-115) 


The first three terms on the right-hand side of this equation repre- 
sent an expansion through second order of exp (—tVmmT7'/h). The 
first of the summation terms, the one proportional to T, can be inter- 
preted as a second-order energy change. That is, the incremental 
energy is not just Vinm, but contains higher-order corrections. Writing 
out the energy correction through second order in the perturbation 
energy, we get 


Pd ae 0 ع اسل‎ )6-116( 


This last equation gives the correct expression, through second order, 
for the shift in energy of nondegenerate states. This result is much 
more easily obtained by conventional methods, i.e., by finding solu- 
tions of 


(H + V)ọ = Ep (6-117) 


Furthermore, the conventional approach based on Eq. (6-117) permits 
simpler handling of degenerate states. However, it has been our pur- 
pose here to give an example of the use of transition amplitudes, rather 
than to give the simplest formulas for the computation of energy shifts. 

Actually, there are more complex problems involving energy incre- 
ments in which the method of transition amplitudes is the simplest to 
apply. In such applications the scheme, as we have attempted to show 
above, is to identify terms in a series proportional to T, T?, ete. Then, 
if we remember that the amplitude to stay in the initial state is pro- 
portional to exp (—7 AE T/R) and that the series expansion is equiv- 
alent to a series expansion of this exponential, the correct expression 
for AE can be written down. 

We have still not discussed the last term in Eq. (6-115). If the 
states E, lie in a continuum, we must also define the character of the 
reciprocal in the sum of Eq. (6-116). If we take it to mean the prin- 
cipal value, just as we found when analyzing the problem in second 
order for n عد‎ m, this extra term can be shown to produce an effect 
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proportional to T and to lead to an additional correction to Eq. (6-116) 
of 
AEn جاح‎ 2 5(Em = Er) Vina tun (6-118) 
k 


But this cannot represent a further correction to the energy for it is 
purely imaginary, and the energy must be real. Let us call it —7y/2 
(the PR is for convenience later) and write 


SETAA 1 O 
AEn — كد‎ = Va a mes ET و‎ T (6-119) 


This implies that the transition amplitude Amm to be in the mth 
state after a long time is proportional to 


exp | -: (a2 — 2) r | = exp [—i(AEm)T] exp 6 ف‎ 


The first factor is the energy shift. The second is easily interpreted; 
for the probability to be in state m after time T is |Amm|? = exp (—YT). 
It falls with time because at each instant there is a probability that 
a transition is made from m to some other state. That is, if all is 
consistent, y must be the total probability per second of a transition 
from m to any state in the continuum of the same energy. This it is, 
because from Eq. (6-118) our y is 


Y= » 2r (Em z Er) |V m|? (6-120) 
k 


So we see that the total probability per second is just the sum of 
Eq. (6-87) over all possible final states as required (i.e., up to the 
required order in V). 

The reciprocal of y is called the mean lifetime of the state. Strictly 
speaking, a state with a finite lifetime has no definite energy; the 
energy uncertainty by the Heisenberg relation is i/lifetime, or y. 

If resonance experiments are performed to find the energy difference 
of two levels, each of which has a decay rate y, the resonance is not 
sharp but has a definite shape. The center of the resonance determines 
the energy difference, and the width of the resonance gives the sum 
of the y’s of each level. 
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Transition 
Elements 


In the preceding chapter we developed the concept of a perturbation 


` treatment for changes of state in a quantum-mechanical system. 


We carried out an investigation of this method as it is applied to 
systems whose unperturbed hamiltonians are constant in time. In 
this chapter we shall continue the development of the perturbation 
concept and generalize the treatment to cover systems where the 
unperturbed state may have a hamiltonian varying with time. We 
shall introduce a more general type of notation and attempt to broaden 
and deepen our understanding of the ways in which changes of state 
take place in a quantum-mechanical system. The notation to be 
introduced applies to a type of function which will be defined in the 
first portion of this chapter. The function is called a transition 
element. 

The chapter is divided into four parts. The first part, comprising 
Sec. 7-1, gives a definition of transition amplitudes and transition 
elements, with the help of examples based upon the perturbation 
theory of Chap. 6. The second part, comprising Secs. 7-2 to 7-4, 
gives some interesting general relations among transition elements. 
The third part, consisting of Sec. 7-5, shows the connection between 
transition elements defined with the help of path integrals and the 
treatment of quantum-mechanical transitions defined in terms of the 
more usual operator notation of quantum mechanics. In the last 
part, comprising Secs. 7-6 and 7-7, the results learned in the preceding 
sections are applied to two interesting problems of quantum mechanics. 


DEFINITION OF THE TRANSITION ELEMENT 


The development of a quantum-mechanical system with time can be 
pictured as follows. At an initial time tı the state is described by the 
wave function ¥(2,t1:). At a later time t this original state will 
develop into the state $(z2,t2). 

At this later time suppose we ask the question: What is the prob- 
ability of finding the system in the specific state x(2,t2)? We know 
from the general principles developed in Chap. 5 that the probability 
of finding the system in this specified state is proportional to the 
square of the amplitude defined by 


Sx*(x2,t2) © (واروته)‎ dre 


We also know from Chap. 3 that the function ¢ can be expressed 
in terms of the original wave function with the help of the kernel K 
describing the propagation of the system between the times tı and ts. 
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Thus, in determining the probability of finding the system in a specified 
state we can start with the original wave function y and bridge the 
time gap with the propagation kernel K(2,1). 

The resulting amplitude, whose absolute square gives the probability 
desired, we shall call the transition amplitude, and we shall write it in 
the following notation: 


(xllly) = SSx*(@2)K(2,1)o (a1) dae day (7-1) 


We wish to return to an even more basic description of the transition 
phenomena, and we reintroduce the action S describing the behavior 
of the system between two time limits. Thus we write the transition 
amplitude as 


(xls = fff x dear) Del) dz dzs (7-2) 


Here we have made the notation a bit more explicit by attaching sub- 
script S to the transition amplitude to indicate the action for which 
the integral was calculated. The path integral is to be taken over 
all paths which go from zı to zz, and the result of this path integral 
is multiplied by the two wave functions, then integrated over the space 
variables at the two limits. 

Before proceeding further, we shall define the notation more com- 
pletely to cover a more general situation. We introduce the func- 
tional F[x(t)] without (for the present) describing its physical nature. 
With this functional we define a transition element as follows: 


(xlFlW)s = Sf fx (za Flx@]e's!"y (a1) Da(t) dz1 dzz (7-3) 


Here F is any functional of x(t) which does not involve x(t) at the 
end points x; or ونه‎ or beyond the end points. In the special case that 
F = 1, the integral of Eq. (7-3) defines a transition amplitude. 

It is difficult to understand transition elements on the level of 
intuitive physics. One approach toward such understanding involves 
a classical analogy. Picture a small particle moving with brownian 
motion. At some initial time, t = tı, the particle is at .ننه‎ We wish 
to determine the probability that the particle arrives at the point ونه‎ 
at the time t = t2. For quantum-mechanical particles, we talk about 
starting from an initial state and arrive at some final state. Thus, the 
point zı for the brownian particle is analogous to the initial wave func- 
tion Y(z) in Eq. (7-2), and the point xz to x(r2). Furthermore, the 
solution of the quantum-mechanical problem requires integration 
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over the variables x; and xz» of the initial and final states—a step 
unnecessary in our classical problem. 

We would solve the classical problem by considering all possible 
paths for the particle’s motion, We would weight each path with the 
function defining the probability that the particle actually follows 
such a path and then integrate the weighted contributions for all such 
paths. The weighting function is analogous to the term 205/2 appear- 
ing in the integral of Eq. (7-2). 

The final position in such a problem would be not a single point, 
but rather a small region, xz to z2 + dz. The result, when properly 
normalized, would be the distribution function P(x.) giving the rela- 
tive probability of arriving in the (differential) vicinity of za. This 
function is analogous to the transition amplitude of Eq. (7-2) in the 
case that y and x are ô functions of position. 

Now suppose we wish to know more about the motion than simply 
the relative probability to arrive at za. For example, we may wish to 
find the acceleration experienced by the particle at some particular 
time, say 1 sec after it starts. But now we need the weighted aver- 
age of the acceleration, i.e., the acceleration for each possible path 
with each path weighted by the function defining the probability of 
the path. Such a weighted average is analogous to the transition 
element of Eq. (7-3). The property of interest, such as the accelera- 
tion at some time t, replaces the function F[zx(t)] in the integral of 
Eq. (7-3). The classical problem could be solved by a path integral 
very similar in form to that of Eq. (7-3). 

In the remainder of this chapter we shall make use of this analogy, 
and we shall occasionally refer to transition elements as “weighted 
averages.” However, it must be kept in mind that the weighting 
function in quantum mechanics is a complex function. Thus the 
result is not an ‘‘average”’ in the ordinary sense. 

The path integral method of solving brownian-motion problems as 
described in this classical analogy is actually a very powerful method. 
It will be developed in detail in Chap. 12. For now, we attempt to 
further clarify the notion of a transition element with the help of the 
perturbation theory developed in Chap. 6. 


Perturbations. Suppose the action describing the development 
of the system can be separated into two parts, so that S = So +o. 
We suppose that the first part So leads to simple path integrals, whereas 
the remaining part ڍ‎ is small enough that we can apply a perturbation 
scheme. We write the exponential function of Eq. (7-2) as 


gilh = eiSolħeicIh (7-4) 
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Using Eq. (7-3), the transition element of Eq. (7-2) becomes 
lls = (xle"|p)s, (7-5) 


The exponential function can be expanded to give 
; 1 
(xlLP)s60 = (xls, + 5 (xlol)s, — 973 (xlas, + ° ° ° )7-6( 


This expansion is a generalized version of Eq. (6-3) and forms the 
basis of the perturbation theory. The transition elements which arise 
in most quantum-mechanical problems result from this expansion. 

Suppose the perturbation action ø results from a perturbation 
potential, so that 


o = JV[x(é),¢] dt (7-7) 
Then the first-order perturbation is given by the transition element 


(xlol)s. = Sxl Vie), ts. dt (7-8) 


To evaluate this element, we need to solve the integral 
VEO = fff “كو زومر‎ iyl) dar dea Dz) (7-9) 


The first step in the solution of this integral is the same as the 
solution for the perturbation kernel K® described in Eqs. (6-8) to 
(6-11). This solution for the path integral is followed by integration 
over both end points, x; and z2, as well as an integral over the midpoint 
x3 [identified as the point c in Eq. (6-10)]. That is, 


(xl ViA, is, = Sx*(e2)Ko(2,3)V(3)Ko(3,1)¥ (a1) dai dra dzs (7-10) 


We have now arrived at an expression which combines three concepts 
previously introduced. First, we have made use of the propagation 
rule for a wave function as defined in Eq. (3-42). Next, we have made 
use of the amplitude function as defined in Eq. (5-31), which gives the 
amplitude that a system known to be in one state will be found in 
another state. Lastly, we have made use of the first-order perturba- 
tion theory given in Eq. (6-11) for the kernel describing the propaga- 
tion in time. All of these ideas combined give the transition element 
of Eq. (7-10). The absolute square of this element is the probability 
that a system starting in state y and acted upon by the small potential 
V (x,t) will be found at a later time in the state x (if state x would not 
be reached for V = 0, that is, if (x|1|y) = 0). 
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We can use Eq. (3-42) to shorten our notation, in the same way 
that the notation of Eq. (6-23) was shortened into the form of Eq. 
(6-25). We defined function ¥(23,ts) as 


¥(3) = JKo(3,1)¥(a1) dai (7-11) 


which is the wave function that would result at the time ts from the 
initial wave function if there were no perturbation acting. In a sim- 
ilar way we define 


x*(xs,ts) = Jx*(r2)K0(2,3) dre | (7-12) 


as the complex conjugate of the wave function which at ts would lead 
to the function x(x) at the time ts if there were no perturbation acting. 
[See Eq. (4-38) and the following discussion, including Prob. 4-7.] 

In terms of these new wave functions, the first-order term in the 
perturbation expansion can be simplified to read 


(xlJVIx(),t] dilys, = SJx*(3)V (3)¥(3) dzs dts (7-13) 


We see here that the transition amplitude written in this form is a 
generalization of the transition amplitude Amn which was introduced 
in Sec. 6-5. If the wave functions on the right-hand side of Eq. (7-13) 
are eigenfunctions, then the resulting transition amplitude is identical 
with Amn“, as defined by Eq. (6-70). 

Thus the evaluation of a transition element of a functional F[zx(d)], 
which depends only on x at a particular time ¢ [that is, an ordinary 
function of x(t)], or of a time integral of such a functional presents no 
problem. The evaluation of a transition element for functionals 
involving the values of x at two separate times is also easy. This 
occurs, for example, in the second-order perturbation term. This 
can be written as 


ya (lols, = aos [| KAVON), lly) at ds (7-14) 


The integrand of this last equation is itself a transition element, and 
it is written as 


(x|V[a(),4)V[x(s),s]l¥) = SSx*(4)V (4) Ko(4,3)V(3)¥(8) dzs dza (7-15) 


where we have substituted tj = s and 4 = t if s < t or t = t and 
USB SD 
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Thus the second-order term in the perturbation expansion becomes 
1 
sa (x || VEON at f Vix@),s] dsv) 


- lf x” (4) V (4)Ko(4,3)4 (3) dxs dts يندك‎ dta (7-16) 


This can be recognized as a generalization of the transition amplitude 
defined in Eq. (6-74). Expressions involving three or more functions 
are also readily written down. 

Equation (7-4) corresponds also to a more general type of pertur- 
bation theory. For example, consider the case of the particle inter- 
acting with an oscillator. After integrals have been carried out over 
the coordinates describing the oscillator, the resulting action can be 
written as So + o, where (see Sec. 3-10) 


11 RT y oT he ibs glx (t) t]g[x(s),s] sin w(te — t) sin w(s — tı) ds dt 
(7-17) 


with g[zx(t),t] characterizing the interaction of the particle and oscil- 
lator, and T = t — tı. 

We have noted that path integrals involving such complicated 
actions are very hard to evaluate indeed; but if the effect of the compli- 
cated term ø is expected to be small, we can obtain useful results with 
less effort with the help of the perturbation expansion of Eq. (7-4). 
To illustrate, we find the first-order term in such an expansion (i.e., 
the first Born approximation). Using Eq. (7-17) for c, we must evalu- 
ate the term Si dias This term can be written as 


 (xlolY)s, = ° fale O gles), Is, sin olte — 2 


X sin w(s — tı) ds dti (7-18) 


= sin wT 


so that the difficult part of the problem is reduced to finding 


(xlglz(t) „tlgle(s),s] 007 
But this we have already done in Eq. (7-15), except that g replaces 7 
Therefore, we write 


(xlglx(@), tlalx(s),s]l¥)s, = JIx*(4) gla (ta) ,ta]K0(4,3) glx (ts) لز[وار‎ )8( dzs dara 
(7-19) 


This expression can be substituted into Eq. (7-18) to obtain the final 
result for the first Born approximation, (7/h)(x\clY)s,. 
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Transition elements will come up more frequently in succeeding 
chapters. In each example they can be evaluated in the straightfor- 
ward manner which we have illustrated here. For that reason, very 
little of the material in the remainder of this chapter is really essential 
to the work that follows. Nevertheless, there are two reasons for the 
inclusion of this material in this book. First, it is possible to obtain 
a very general relation between transition elements. This relation 
might well serve as an alternative starting point for the definition of 
quantum mechanics. Second, for many people already familiar with 
the more conventional operator notation of quantum mechanics, it is 
helpful to have examples of the translation from the more customary 
representation into that which is used in this book, such as expressions 
of the form of Eq. (7-3). 

With the rules for translation available, the subject matter of the 
later chapters, developed as it is from the path integral approach, can 
be appreciated in terms of more familiar symbolic concepts. 

The relations discussed in the remainder of this chapter are inde- 
pendent of the form of the wave functions which describe either the 
initial or final state of the system, and they are used in defining the 
integral of the transition element. For this reason we shall abbreviate 
our notation by omitting any specific reference to these wave functions. 
Thus a transition element will be written as (F}s instead of (x|F|y}s. 


7-2 FUNCTIONAL DERIVATIVES 


We are embarking on a mathematical development which leads to 
an interesting relation between transition elements. This relation 
finds its most elegant expression in terms of a mathematical idea, the 
functional derivative. Since this idea may not be familiar, we describe 
it in this section. 

The functional F[z(t)] gives a number for each function x(t) that 
we may choose. We may ask: How much does this number change if 
we make a very small change in the argument function z(t)? Thus, 
for small n(¢), how much is F[a(t) + n(t)] — F[z(t)]? The effect to first 
order in 7 (assuming it exists, etc.) is some linear expression in 7, 
say, JK(s)n(s) ds. Then K(s) is called the functional derivative of F 
with respect to variation of the function z(t) at s. It is written 
6F/6x(s). That is, to first order 


Fle + n] = Fle] + f O A ب‎ ` (7-20) 
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This ôF/ôx(s) depends on the function x(t), of course, and also on the 
value of s. Thus it is a functional of z(t) and a function of time s. 
We may look at it another way. Suppose time is divided into 
very many steps of small interval ¢, the values of the time being 
ti (tı = e +t). The function z(t) can now be specified approxi- 
mately by giving the value x; that it takes on at each of the times &. 
The functional F[x(t)] now is a number depending on all the 2z;; that is, 
it becomes an ordinary function of the variables i, 
BILOL MC TEAS (7-21) 
Now we can consider its derivative with respect to one of these vari- 
ables ,زه‎ 0F/dx;. Our functional derivative is just this partial, divided 
by ء‎ and taken at the point t: = s. That is, 


ôF 1 oF 
5x(s) e Oz: E 


This we can see as follows. If we alter the path from z(t) to x(t) + 
n(t), we change all the x; from z; to x; + n; [where n; = n(t;)],80 that the 
first-order change in our function is 


F(. Te Li + Ni, Veer F Nipi eae 9 — 7 se gli Litt) o ° ) = 

(7-23) 
from the ordinary rules of partial differentiation. If now we call 
(1/6) (F /ðx:) = K;, the last sum is 2 Kine, which in the limit becomes 


fK(i)n(@ dt. Soif this limit exists as e» 0, then it is equal to ôF/ôx(s). 
One can also use the ideas of differentials. Just as we can write 


ate 0 of 0 


so we can write for the first variation of any functional 


ôF 

a n 7-24 
ôF 1 376) 6x(s) ds ( ) 
where ôz(s) is the differential change in path at x(s). 


PORT ee i ^ L(y) dt, show, for any s inside the 
range tı to t2, 

5S d (aL aL 
| دده‎ ee et) 7-25 
ôx (s) ds (5) + Ox 5 


where the partial derivatives are evaluated at t = s. 
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Problem 7-2 If F[x] = x(r), show that 


oF 


Ege- (7-26) 


Problem 7-3 Show that ôF/ôj(r,s), where 


ia exp [4 f ع‎ Sj (وارة ؟) [(دابد؟)‎ R(rı ae are i = te) dri drs dtı dta] 
is given by 

oF = za D Na lel a. / 7 i = 
wan =l | RC r’, s — ijt) dr ar | F (7-27) 


Note that the function J(r,t) is a function of the four variables (z,y,z,t). 
Thus the single coordinate s, as used in Eq. (7-14), for example, must 
be replaced by the set of coordinates (zx,y,z,t) in specifying the point 
at which the functional derivative is evaluated. 


The general relation between functionals which we mentioned at 
the beginning of the preceding section may be obtained by trying to 
develop a formula for the transition element of 6F/éz(s). This we 
can do most easily in this way. Consider 


(F)s = f F[z(t)]e MSI Dz(t) (7-28) 


Now in the integral over paths substitute z(t) + n(t) for the variable 
x(t). For fixed n(t), D[x(t) + n(é)] = Dz(t) [since d(x; + n) = dai]. 
But the integral is unchanged by a substitution of its variable. Hence 


(PF), = j! F[x(t) + n(t)Je4St2©+901 Da () 
= | Feos De) + | | i na” as| el S121 Der(t) 


+ أ‎ Fl) | f ويج‎ no do ممصت‎ oe + < 
(7-29) 


expanding the exponential and displaying only to first order. The 
zero-order term is exactly (F)s again, so the remaining terms must all 
vanish. In particular, the first-order term must vanish for any 7(s), 
so that we conclude the relation 


/ oF \ _ م‎ ١ 
(EO = Tr eae (7-30) 


This general relation has many important consequences. 


Transition elements 


173 


It would be possible to use Eq. (7-30) as a starting point to define 
the laws of quantum mechanics. One could work backward to 
reproduce, for example, Eq. (7-6). If some generalization of quantum 
mechanics is desired, one might suppose such a generalization is 
included in the action S appearing in the term 65/3, or perhaps start 
with a form like Eq. (7-30) and introduce modifications with the help 
of the differential notation. Julian Schwinger has been investigating 
the formulation of quantum mechanics suggested by Eq. (7-30). 

We can see how the relation of Eq. (7-30) comes about in another 
way by imagining our time split into intervals e and functionals 
replaced by functions of the points x; corresponding to t; Then con- 
sider the path integral 


ji oF eM SOl Dar (t) (7-31) 
02 
where t, is some intermediate time not at either end point. The path 


integral is simply an integral over all the points z;. So we integrate 
by parts to get 


OF |. i 05: 

O el) Siz] MEM LP gih Slz()] z 
f See Dar(t) sfe n° Da(t) (7-32) 
dropping the integrated part. 

Problem 7-4 Discuss why the integrated part vanishes. 


The result is 


pore. tJ, OS\ : 
Fol Aas E (7-33) 


which is the same content as Eq. (7-30). 
It is better to write these relations as differentials, 


(3F)s = — $ (F 88)s (7-34) 


for then the specific variables on which F and S depend need not be 
indicated. 


Problem 7-5 Argue that Eq. (7-34) may be misleading, for Eq. 
(7-33) applies only to rectangular coordinates. Do this by studying 
the corresponding relation where spherical coordinates, for example, 
are used and we wish to find (dF /dr;)s. 
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7-3 TRANSITION ELEMENTS OF SOME SPECIAL FUNCTIONALS 


The relation of Eq. (7-34) has many interesting implications. In 
this section we shall investigate some of them. We shall take the 
special case of a one-dimensional particle moving in a potential 


V[x(t)]. 
Suppose the action over the path of the particle is given by 
s= ff E — view} a | (7-35) 


Upon e of the small variation 6x(t) to each path there results 
(to first order) 


of a if * [mê + 17” )([ dx(t) dt (7-36) 
Using Eq. (7-34), we have 


(Gaye : (r 1 ^ [mê + V'(a) loc) dt) (7-37) 


Alternatively, we could return to the point of view used in developing 
Eq. (7-33). That is, we imagine time divided into small slices of 
length e. In this case the action S can be written as 


21-1 


S = d [m soa = | (7-38) 


i 


If we select a particular time t, and, as before, let x, be the associated 
position of a path, then 


ja, wm (BS te) ب‎ Vase (7-39) 


Ot, FE 


Upon application of Eq. (7-33) there results 
or } — 227, cy 
(2) ete 2F [m (BeBe te) ie e |) ang 


€ 


In this last expression the factor involving an e in the denominator 
is actually the acceleration û evaluated at the time t. Thus Eq. (7-40) 


_is just a special example of Eq. (7-37). In particular, it corresponds 


to Eq. (7-37) if da(é) is zero for allt كد‎ t If 6x(t) is assigned the value 

٠ êz, (t — t), then Eq. (7-40) results. Actually Eq. (7- ~40), since 
it is true for all k, is completely equivalent to Eq. 2 in a more 
detailed notation, 
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In Eq. (7-37) suppose we choose the special function F = 1. Then 
ôF = 0 and we have 


= 1 f [më + V’(x)] 8(0 9 - 0 )7-41( 
Since this result must hold for any arbitrary choice of 6x(t), it must be 
that 

(mi) = —(V"(2)) (7-42) 


at all values of time. This is the quantum-mechanical analogue of 
Newton’s law. Making use of the classical analogue for a transition 
element, described in Sec. 7-1, this result says that the weighted 
“average” of the mass times acceleration at any time “averaged” over 
all paths with the weight e*‘/* is equal to the weighted “average” 
of the force (negative gradient of the potential) at the same time. 

As another example, suppose F is some arbitrary nonzero functional 
of all position variables except x,. Then the left-hand side of Eq. (7-40) 
is zero and there results (since dF /dz, = 0) 


(Fests, . . . gUk—1,Uk+4+1,) ٠ ° ٠ tN) r tm + e |) 
= 0 (7-43) 


This equation says that the transition element of më + V’(z), averaged 
over all paths, is zero at tą even if these paths are weighted with an 
arbitrary functional, so long as the functional is independent of the 
position of the path at the time & of interest. 

Suppose, however, the functional does depend upon the position 
of the path at the time of interest. In particular, suppose simply 
that the functional F is .ينه‎ Applying Eq. (7-40), we have 


E £ (nas E E O zV’) 
3 

= ; (mex (e= le AL = =) + ex V” )0م‎ (7-44) 

If we suppose that the potential V is a smooth function, then in the 


limit as e» 0 we find that ex,V'(x,) becomes negligible in comparison 
with the remaining terms. The result is 


Gee re) = (zm n 2 a) (7-45) 


This last equation involves the product of position variables x and 
momentum variables ma. In the first term the momentum is evalu- 
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ated first as a linear average corresponding to the time t, + ¢/2, and 
the position is taken at tą. In the second term the position is again 
taken at t but the momentum corresponds to the time tp — €/2. 
Thus this equation says that the transition element of a product of 
position and momentum depends upon the order in time of these two 
quantities. ; 

Later on, when we make a translation into the more usual operator 
notion, we shall see that both the operator equation of motion, cor- 
responding to Eq. (7-42), and the operator commutation laws of 
Eq. (7-45) have been derived from the same fundamental relation, 
Eq. (7-34). 

We can derive a further result from Eq. (7-45) which will give us a 
better idea of the characteristics of the paths which are important in 
quantum mechanics. Consider the two terms 


(ym E > a (7-46) 
and 
ae ee (7-47) 


These two terms differ from each other only in order e, since they are 
the same quantity calculated at two times differing by the interval e. 
Thus we are justified in substituting Eq. (7-47) for the second term 
in Eq. (7-45). The result is 


(mB ® (xy — aD) = È) (7-48) 


Alternatively, we can write this as 


)7-49( كت )=( 


€ ime 


This equation says that the transition element of the square of the 
velocity is of the order 1/e, and thus becomes infinite as e approaches 
zero. ‘This result implies that the important paths for a quantum- 
mechanical particle are not those which have a definite slope (or 
velocity) everywhere, but are instead quite irregular on a very fine 
scale, as indicated by the sketch of Fig. 7-1. In fact, these irregulari- 
ties are such that the “average” square velocity does not exist, where 
we have used the classical analogue in referring to an “average.” 
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If some average velocity is defined for a short time interval At, as, 
for example, [x(t + At) — x(t)]/At, the “mean” square value of this is 
—h/(im At). That is, the “mean” square value of a velocity averaged 
over a short time interval is finite, but its value becomes larger as the 
interval becomes shorter. 

It appears that quantum-mechanical paths are very irregular. 
However, these irregularities average out over a reasonable length of 
time to produce a reasonable drift, or “average” velocity, although 
for short intervals of time the “average” value of the velocity is very 
high. 


(xb, 0) 


(xa: Ta) 


x 


Fig. 7-1 Typical paths of a quantum-mechanical particle are highly irregular on a 
fine scale, as shown in the sketch. Thus, although a mean velocity can be defined, no 
mean-square velocity exists at any point. In other words, the paths are nondifferentiable. 
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Problem 7-6 Show, for a particle moving in three-dimensional 
space 2, Y, 2, 


h 
(zer — tx)?) = ((Yr+i — Ye)?) = ((2n41 — &)?) = — a (7-50) 
(wees — te) (Yeri = Ye)) = ((Zer1 — e) kr — 2) 
= ((yet1 — Yr) (Ze+1 — 2)) = 0 (7-51) 
It will not do to write the transition element of the kinetic energy 
simply as 
3 (r eS re oT (7-52) 


for this quantity becomes infinite as « approaches zero, How shall 
we find an appropriate expression to represent the kinetic energy? 
We might make the heuristic guess that only those functionals F 
which might appear in some kind of a physical perturbation problem 
may be of importance. How can we get the kinetic energy by a 
perturbation? If the mass of the particle were perturbed by a 
factor 1 + n (with n very small) for some short interval of time At, 
the action would be perturbed by n At(m/2)%?, which is proportional 
to the kinetic energy. We are led to ask: What would be the form 
of the first-order perturbation (c)s, if m were changed to m(1 + n) 
for a short time? 

For simplicity we can take the short time to be just e, the step 
used to define the time spacing, so that the first-order term divided 
by en is the kinetic energy. The perturbation in S of Eq. (7-38) (if 
the m in the 2 = k term is changed to m + nm) is clearly en(m/2) (we+41 
— (2/62. But this is not the only change in the path integral if m 
changes. The normalization factors A for each m vary as m+”, so a 
factor (1 + 147) is introduced from this. Hence the entire first-order 
change in the path integral when m is so changed becomes, after 
dividing by ne, 


: 0 كه‎ Tr)? it 7 ) (7-52) 


which should be satisfactory for 2/7 times the kinetic energy. 

Using Eq. (7-49), one might expect this to vanish; but Eq. (7-49) 
is valid only as e— 0 to the order 1/e. The quantity in Eq. (7-58) is, 
in fact, finite as e» 0. The expression can be rewritten by expanding 
the quadratic term. In Eq. (7-40) let F be .ينه — يبيد‎ If terms of 
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lowest order in e are kept, the result is 


( 0 (= = =) 6 = St) - (5 Ga 38 2y pre +5- 7)) )7-54( 


Thus we can define the left-hand side of Eq. (7-54) as the transition 
element of the kinetic energy. 

We see from this result that the easiest way to produce satisfactory 
transition elements involving powers of the velocities is to replace 
these powers by a product of velocities, each factor of which is taken 
at a slightly different time. 

In simple problems the transition elements can sometimes be evalu- 
ated directly. For such problems the same results can also be obtained 
by using the relations among transition elements which we derived in 
Sec. 7-2. These relations may supply us with soluble differential 
equations for the transition elements. We shall give a few illustra- 
tions, but it will be readily seen that the examples for which the 
method works must be so simple that a direct evaluation would not 
really be much more difficult. 

For our first example, consider the case of a free particle going 
from 2; to xə in the total time interval T. Let us find the transition 
element of the position at the time t, that is, x(t). Of course, this is 
some function of ¢ and it is clear that 


(x(0)) = zı(1) (a(T)) = z1) (7-55) 


Since any potentials acting on the particle are constant in space (i.e., 
no forces act), the second derivative of the transition element of posi- 
tion is zero in accordance with Eq. (7-42). Thus an integration gives 


5 
GO) = [ne + يم‎ a» |) (7-56) 
Note that the expression in the brackets is just the value of z(t) along 
the classical path Z(t). 
Problem 7-7 Show for any action that is quadratic 
(x(t)) = 2)0)1( (7-57) 
As a somewhat less trivial example, let us try to evaluate the 
transition element (z(t)z(s)} for the same free-particle conditions. 


Since this is a function of two times, we can write it as f(t,s). The 
second derivative with respect to ¢ is 


oH = (aale) (7-58) 
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This transition element can be worked out by substituting F = x(s) 
into Eq. (7-40). For s t, following the arguments leading to 
Eq. (7-42), the result is —(1/m)(V’[x(t)]x(s)); while for s = t, follow- 
ing the arguments leading to Eq. (7-44), we find that the transition 
element of Eq. (7-58) is of order 1/e. In the limit of small > we have 

82 5 h 1 
m ورج‎ = (me(t)2(s)) = = a(t — s) — (V'[x]2(8)) (7-59) 
Since for our free particle the potential is independent of position, 
the second term on the right of Eq. (7-59) vanishes. The resulting 
equation may be solved by dividing the region of interest into two 
parts. Fort < s 


f = a(s)t + b(s) (7-60) 
while for t > s 
f = A(s)t + B(s) (7-61) 


Thus the first derivative of the function f with respect to ¢ jumps 
by the quantity A(s) — a(s) as ¢ goes from just below to just above s, 
and in accordance with Eq. (7-59), A(s) — a(s) = h/mi. 

The boundary conditions state that 


(x(0)x(s)) = x3(x(s)) = 218(s){1) 
{x(T)x(s)) = x2%(s){1) 


This is not enough information to determine all of the four functions 
a, A, b, and B, but we can either make use of the relation 


4 = (4) ae - 9 (7-63) 


obtained by differentiating f with respect to s, or else notice that 
f(t,s) must be symmetric in ¢ and s. One can conclude that the 
functions a, A, b, and B must all be linear in s. The boundary con- 
ditions are now sufficient to determine the solution. The result is 


(7-62) 


(x(t)a(s)) = Fore + — s(T — J (1) fors < 
(7-64) 
= Fore + La (T — 0 | (1) fort < 8 l 
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That this result is right can be seen by inspection. The product 
of two classical paths taken at different times, #(t)ē(s), is the solution 
of the homogeneous equations obtained by setting the right-hand 
sides of Eqs. (7-59) and (7-63) equal to zero, which satisfies the neces- 
sary boundary conditions. The last terms on the right of the pair of 
equations (7-64) are the special solutions of the inhomogeneous equa- 
tions (7-59) and (7-63), which are zero at the end points. 

The transition element of the product of two positions taken at 
two different times contains more than just the product of the two 
corresponding positions along the classical path. There is a small 
additional term which is purely quantum-mechanical in nature. This 
additional term is consistent with our picture of quantum-mechanical 
motion. Even though the particle moving between fixed end points 
will be found on the average along the classical path, it has an ampli- 
tude for motion along all alternative paths. This fact must be 
remembered when considering the transition element of the product 
of positions at two different times. All the possible positions among 
all the various alternatives must be accounted for in the transition 
element, and this accounting introduces the extra term. Only at the 
specified end points are no other alternatives possible. 

We can better understand the significance of this result if we make 
use once again of the terminology from our classical analogue. Sup- 
pose the path of the particle goes through a particularly large value of x 
at some time s. Then the “average” value of x at a later time t 
is not just the ordinary average z(t). There is a correlation with 
the previous large deflection. Therefore, the “average” product is 
not just the product of “averages.” 

In this and other applications of the classical analogue, we remember 
that the “average” referred to is defined with the help of the weighting 
function .لكأم‎ This weighting function is not positive definite, and 
is in fact complex. Thus we develop such purely quantum-mechan- 
ical results as that of Eqs. (7-64) wherein the extra correlation term is 
pure imaginary! 


Problem 7-8 Find the transition element z(t)x(s) = f(t,s) when 
the potential is not constant but, rather, corresponds to that of a 
forced harmonic oscillator. Do this by obtaining differential equa- 
tions for f and trying the solution 


(f(t,s)) = [2 a(s) + g(t) 1) (7-65) 


Obtain an equation for g(t,s) showing that g is independent of the end- 
point values zı and x, and of the forcing function [derivative of the 
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potential) y(t)]. Show in general that, with T = t: — tı, 


g(t,s) O ROT E ws SIN w(T = t) s<t 


mio sin aT Si wt sin w(T — 8) TELER (7-66) 


GENERAL RESULTS FOR QUADRATIC ACTIONS 


Evidently if the action S is a quadratic form, transition elements 
of many functionals can be determined readily. This suggests that 
we extend our consideration into a somewhat more general class of 
functionals. The technique to be used is the same as that described 
in Sec. 3-5. For example, we note that with a quadratic action S we 
can easily evaluate the transition element of exp (2/R) Jf(tz(t) dt, 
where f(t) is any arbitrary function of time. The transition element 
of such a functional can be written as 


(eII OEH dt) = Jy eG MIS+IFO2( 4 Dx(t) (7-67) 


a 


If the original action S is gaussian, then so is the action 
S' = S + ffx dt 


Thus the path integrals on the right of Eq. (7-67) can be carried out 
by the methods of Sec. 3-5. If S% is the extremum of the action S’, 
then the factor exp (S/,/H) can be extracted as a factor for the path 
integral of Eq. (7-67). The remaining factor is a path integral over 
the paths y(t), which run from zero to zero during the allowed time 
interval. (Weset x = + y, where Z is the classical path correspond- 
ing to the extremum of the action.) 

The integral over the paths y does not depend upon the function 
,)ل‎ since this function appears in the action S’ multiplying only a 
linear term in z(t), and we have seen [cf. Eq. (3-49)] that the remaining 
path integral involves only the quadratic parts of S’ which are not 
more than the quadratic parts of S. This means that the path 
integral on the right-hand side of Eq. (7-67) can be reduced to an 
exponential function multiplied by the transition element (1). The 
result is 


(exp [jf 1020 (إنه‎ = {exp [$ ديم‎ s |} ay (7-88) 


Once the extremum رك‎ has been evaluated, the extremum Sa can be 
obtained from it by setting f(t) identically equal to zero. The action 
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of the forced harmonic oscillator, defined in Eq. (8-68), is a special 
case of this action S,. 


Problem 7-9 Use this result to show that if S corresponds to an 
harmonic oscillator 


S= ® fia- fara 


then 


(exp BEZ a) 21) (exp É جيه‎ | 


Xx Ear * f(t) sin w(t — tı) dt ia a faa TO sin w(t, — t) dt 


mo 


= 0 jh ik f(Df(s) sin w(t — t) sin w(t — tı) ds al) 
where xı, xz are the initial and final coordinates of the oscillator. 
From the transition element given by Eq. (7-68) we can obtain 
the transition element of z(t) itself by another method. Suppose we 
differentiate Eq. (7-68) with respect to f(t). The result is 


6 exp | j | a) = ger [a Se 5 s)|} (1) 


_ 68a 0 if tes 
= FO {exp |; (S sa) |} (1) (7-69) 
Therefore, by evaluating both sides when f(t) = 0, we obtain 
8S1 wl 
O = 1) 58 [poo (7-70) 


We can continue this process to get the second derivative as 


(0= = (Ê yao oP | 8 = 8 | lye O 


OS TOS Sa 

ea k TOYO T FO ا‎ = ae 
Actually, since S% is quadratic only in f [cf. Eq. (3-66)], the transition 
element of a factor of any number of 2:5 can be directly evaluated in 
terms of 6S/,/éf(t) and the quantity 5°S../df(t) 6f(s), which is inde- 
pendent of f. This explains the form of Eqs. (7-64) and (7-65) and 
permits the transition element of a factor of three zs to be written 
down. 
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Problem 7-10 Show, for any quadratic functional, if we write 


(x(t)) = &(t)(1) and (2(#)x(s)) = [(&(Ha(s) + g(t,s)KX1), that 


(x(t)x(s)x(u)) = [(E()%(s)z(u) + 2(Dg(s,u) + Z(s)g(t,u) 
+ &(u)g(t,s)]{1) 


Find the transition element of the product of four zs. [Suggestton: 
Since Si, — Sa is quadratic in f and zero for f = 0, it must have the 
mathematical form S, — Sa = 4S SfOf(s)g(t,s) dt ds + Jaf dt, 
where و‎ and 2 are some functions.] 


TRANSITION ELEMENTS AND THE OPERATOR NOTATION 


In this and the following sections we shall see how transition elements 
look in the conventional notation of wave functions and operators. 
This will help the reader who is familiar with that form of expression 
to relate the results of path integral calculations to other results that 
he already knows. 

If F is a function only of z at a single time, say, the function V (z;) 
at time t,, we know from Eq. (7-10) how to evaluate its transition 
element. Similarly, if F depends on the value of z(t) at two different 
times, Eq. (7-15) tells us what to do. 

Let us consider next the case that F represents the momentum at 
time عا‎ and make use of the approximation that the time axis is cut up 
into slices of length e. Thus 


(tr41 — ZE) (7-72) 
Then we have 

(x 
The right-hand side of Eq. (7-73) can be written as 


Y), = Z Cxlensald)s — (xlarl#)s) (7-73) 


m 
Pe (TE+1 س‎ Xk) 


[f tatt dove, t+ de — | x*@)ev@i de] a- 
Now making use of the wave equation 
(a, t + O = V(r) + e و‎ (7-75) 


x*(2, t+ © = x*(2,t) + as = x*+ " (Hx)* (7-76) 
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` from Prob. 4-3, if H is the hamiltonian belonging to the S. Therefore, 
to first order in e 


Í x* (z, t + e)ey(z, t + e) dz = f x*(2,t)ay(2,t) dz 
- !ع‎ | x*@ozlHy@,)] de — | [H*x*(eDlev(z,) de} (7-77) 


By Eq. (4-30) this last integral can be written as fx*(z,t)[H2y(z,t)] dz, 
or more simply we have 


(xml) = — = / x*(cH — Hx) dz (7-78) 
using the operator notation. This is the same as 

het xh? oy a ah oy 

gm [xa ge Oe م چ‎ 19) 


where we have used the result of Prob. 4-4. The operator )7/2( 8/ عدة‎ 
is called the momentum operator or, more specifically, the operator 
representing momentum in the x direction. We already see why. 
Constructing the transition element of 702 is equivalent to putting 
the operator (4/1) ð/ðx between x* and ¥, just as constructing the 
transition element of x is equivalent to putting z between x* and ¥. 
These relations can be understood, perhaps with greater clarity, if we 
go over to the momentum representation. If 


x(p) = [| x(a) tve dx 
(صال‎ = fT, Woe de 


are the momentum representations of x and ¥, one can show that 


[02 @ XO ae = [° x) dp (7-81) 


(7-80) 


Problem 7-11 Show this. 


Another way to see this relation is the following. Consider the 
transition amplitude given by 


(xili) = SSx*(an,tw) K (an, tw 321,t1) (1,1) 021 dın (7-82) 


Now suppose the whole z: axis is shifted to the right by a small amount 
A. Calling the new axis ريد‎ we have 


tı = 2 — À (7-83) 
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Using this new variable rather than the old zı will not alter the transi- 
tion amplitude of Eq. (7-82). It becomes 


N-1 


60 = E a 0 x(xn,tw) exp 6 X S[Te+ 1, مز‎ 15k, e) 
j= 


Z2 
T 6 S[xa, te; zi - a} MCA — A, t) 2200 dz; dt, (7-84) 


where the path integral for the kernel has been written out explicitly, 
using the methods of Eq. (2- 22). 

Next, we expand S[z2, tz; az, — A, t] and يندالا‎ — A, 0 in Taylor series 
and he only the first-order terms. In particular the exponential 
function becomes 


N-1 


i 2 8 7 
exp | a (i) Sersala | fı = 6 A E Stents; | (7-85) 
We may drop the prime notation in the integral defining the transition 
amplitude, since ينه‎ is a variable of integration. The form of Eq. (7-84) 
now becomes 


(ltl) = [f OKLA) dedre - £4 ff ADKL 
x 0 =. Seer tae ae West} dade (7-86) 


where we retain the notation that the point ود‎ is spaced along the path 
a(t) only by the short time interval e from the point xı and tz = tı + €. 

The first term on the right of Eq. (7-86) is identical to the transition 
amplitude on the left. This means that the remaining term must be 
zero. But this remaining term is a combination of two transition 
elements. Thus 


(x| - g Sli, t+ 65 رن‎ hl vy = (xi h u) (7-87) 
In the convention of Eq. (2-22) we use the classical action along each 
of the short elements of the path. Thus the action S[2,1] appearing 
in Eq. (7-87) is the classical action for the initial path element. 8 
negative derivative with respect to x1 is the classical definition of the 
momentum at zı. So we can write 


(xlpily) = (x1 à 2) (7-88) 


which is the same result as that obtained in Eqs. (7-78) and (7-79). 
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Sometimes working with a complicated S that results perhaps 
from the partial elimination of interacting parts, we would like to 
identify the functional p(t) which corresponds to the momentum at 
time té. The work of the preceding paragraph suggests a general 
definition. The first-order change in the transition amplitude {x|1]¥), 
if all coordinates corresponding to times previous to t are shifted by 
—A, is this A times (x|p(é)!¥). From this principle the momentum 
functional may be found for an arbitrarily complicated S. In a like 
manner, the hamiltonian or energy functional can be defined by shifting 
the time variables, as we shall describe in Sec. 7-7. 


Problem 7-12 Show, if V is any function of position only, that 
V — V(x 

( 7 | (x41) : (vz) y) 

= z Pr x*(HV — VH)y dz (7-89) 


Consider the case that V is also a function of the time. Show that 
the transition element of dV /dt is equivalent to the transition element 
of the operator (i/A)(HV — VH) + dV /ðt. 


Problem 7-13 Show that 


TS 


(xlmaly) = $ f7, x" (Hp - pH dz (7-90) 


and argue for any quantity A, given in terms of an operator or other- 
wise, that dA/dt is equivalent to 04/81 + (i/h)(HA — AH). 


Next we consider an expression F involving two quantities evaluated 
in rapid succession, such as 


PFZ te (7-91) 


This evidently gives 
1 00 2.3, : dud 
IFW) = = f, [| xest + Omak, t + زع‎ y, DIV, dy de 
— f x* (x, t)mz?y (x,t) dx (7-92) 
where t = t}. In developing Eq. (4-12) from Eq. (4-2) we saw 


J, Ket + 64, DI) dy = Fa) + عر‎ Hf) (7-98) 
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so that the first integral in Eq. (7-92) is 
: des x* (z, t + €)max ( a5 £4) vp (r,t) dz (7-94) 


Expressing x* by Eq. (7-76) and using the hermitian property of H, 
we find that this integral is ` 


: Ta x*(z,t) (1 = m) mx 2 ar 11) xy (x,t) dx 

= : ji x*(x,t)mxr?y (x,t) dx + 0 x*(x,t)m(2H — Ha)xy(z,t) dx 
(7-95) 

Thus finally 


Tk y 


xe Dm(eH — Hz)zY(e,t) dz‏ | : و 


dz (7-96)‏ (ارنة) بإندط (1,)* | = 


the last step following from Eq. (7-78). This is an example of the 
general rule: In writing the integral definition of the transition element 
for a set of quantities corresponding to a succession of times, the 
corresponding operators are written in order from right to left, accord- 
ing to the order in time of the original transition element. If there 
is a finite time interval At between them, a K, or alternatively the 
operator e7™S كذ‎ must be inserted. (For an example, see Prob. 
7-16.) ` As the time interval e between two successive quantities 
approaches zero the K approaches a 6 function and the rule results. 


/ Lk+1 — Tk 
AK aa 


Problem 7-14 Show that the transition amplitude of (m/«) يبيرته)‎ — 
rx) f(%e41) is equivalent to that of (f: p). 


Problem 7-15 Show that the rule works for two successive 
momenta, that is, 


(| 


wee am ليد‎ Pe ee ees — 2-1 


(= / E اك‎ dx dy 


=— i i: x*(u,) 25 (z,) de dy (7-97) 
Problem 7-16 Show that 


(x Y) = | XEDE lety (FF) vue) ay ae 


(7-98) 
if 4 = tand & = s, provided t > & What happens if t < t? 


M(Lep1 — Lr) 
€ 


Tı 
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Notice that the square of the momentum p? corresponds to pp, or 
two successive velocities times mass multiplied together (as in Prob. 
7-15). It does not correspond to the simple square of velocity at one 
time, (x|m?(zi+1 — xx)?/e?|¥); for that goes to infinity as 77/26 when 
e— 0, as we have seen in Sec. 7-3, particularly in Eq. (7-49). The 
difference between this expression 727/26 and the left-hand side of 
Eq. (7-97) is in fact p? in the limit. That is, 


/ m? (Leo. — Lx)? 
X yr en 


vy = TÈ catty) 


+ {| m Be mn i 


y) (7-99) 


Le41 — Tk Tk — 2-1 
€ 


Problem 7-17 Prove this, using Eq. (7-40) with 


m 
aE (x41 — Tr) 


THE PERTURBATION SERIES FOR A VECTOR POTENTIAL 


The singular behavior of the transition element of the square of the 
velocity, as shown in Eq. (7-49), has as a consequence the fact that 
many expressions involving velocity must be translated with care. 
For example, the lagrangian of a particle in an electromagnetic field is 


L= 5 lt}? + eV (x,t) — =f - A(r,t) (7-100) 


Let us take V = 0 and ask for the effect of the vector potential A 
considered as a perturbation. That is, with So = (m/2)f|t|? dt, 
c = —(e/c)fi A(r,t) dt, we develop a series for use in a perturbation 
treatment and solve for the resulting transition elements. Thus 


(ets, = (De, + 20), — gaa (Da, + ° ° ° (7-101) 
h 2h? 
The first-order term is —tie/ħc times the expression 


(ft ١ A(r,t) dt) (7-102) 


We wish to translate this to operator notation. In defining o for a 
discontinuous path (a series of steps of time length e) we might at 
first expect to write either 


o= —<) (re — n) ١ Alta!) (7-103) 


k 
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or 


giae D (Erti — Fr) + A(Tr+1,le+1) (7-104) 
k 

Either one, in the limit for a continuous path, gives the integral for ø. 

But if we look at a particular component of A, say, Az, we find that 

A.(te1,te41) differs from A.(t;,t.) by approximately 


aA, 
27 )7-105( 


ع + ra) VA,‏ — 1بية) 


which, when multiplied by r.41 — r, again, might be expected to be of 
second order for each k (leading to a term only of order e when k is 
summed). But our paths are not continuous and the transition ele- 
ment of the mean square of 2,41 — 2 is of first order. In fact (see 
Prob. 7-6), 


ات رك وم 


(tien Lx) Yixt — yr) ~ 0 


h 
(Yrr1 — yx)? = — = 
etc., to first order in e. Hence Eq. (7-103) differs from Eq. (7-104) by 
approximately 
ħe 7 
ae V- A(t) = كد‎ if V-Adt (7-106) 


a zero-order term. So it is imperative to decide which form is correct. 

The general answer to such a question was given in Chap. 2. There 

the rule given was that S is replaced by D Ser(Xe+1,te413%x,tk), where 
k 


Sa is the classical action to go from one point to a neighboring point. 
It is not necessary to calculate this action exactly, but only sufficiently 
closely to resolve ambiguities. Equations (7-103) and (7-104) are 
not sufficiently closely calculated for this purpose, but the classical 
action for a short interval is very close to 


m|tr+1 — Tel? 


Salk + 1, k] = De 


+ 5 [AGresistess) + ACist)] (era = ro) 
(7-107) 


Therefore, the correct expression for ø is the average of Eqs. (7-103) 
and (7-104), so that the transition element of Eq. (7-99) is 
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(0 (Tita — Te) ١ IA (Tryn) + A(t,ts)]) (7-108) 


Leaving the sum on k for later evaluation (as an integral over time) the 
result is expressed as the operator (1/2m)(p-A + A> -p) (see Prob. 
7-12). 

That is, in an electromagnetic potential, the first-order term in the 
perturbation expansion has the same form as the first-order term 
given in Eq. (6-11), but the quantity V is replaced by the operator 
—e/(2cm)(p-A+A-p). 

This conclusion is not true in the second order. The second-order 
term requires our finding 


1E K [J t+ A(r,t) dé]?) -3(¢ y ONI e Berni 
ar AC) {nes iti) 3 [A (ratat) + A(rut)]}) (7-109) 


Nothing special happens for the terms k z l, and we obtain in fact 
precisely the second-order term expected by comparison to Eq. (6-13) 
with V replaced by the operator —e/(2cm)(p-A + A‘ p). But when 
l =k, the coincidence of the two velocities gives a new term. In 
view of Eq. (7-49) and Prob. 7-6 we get an additional quantity 


s (+ K (5 [A(Te+ıte+1) + Act), (7-110) 
7 


m 


which is equivalent to ie?/2mc?f[A(r,t) ٠ A(r,t)] dt and has the same 
effect as the first-order action of a potential (e2/2mc?)A- A. 

Thus the perturbation expansion for the action of a vector potential 
has the same form as Eq. (6-17). The potential V is replaced by the 
operator —e/2mc(p-A + A‘ p) + e?/2mc?A+-A. We have shown it 
to second order in A, but a little consideration shows it is true to any 
order. 

The hamiltonian for a particle in a vector potential A is 


H= (2-24) (p - 2a) (7-111) 


It differs from that of a free particle [which is (1/m)p ° p] by just this 
operator —e/2mc(p-A + A‘ p) + e?/2mc?A-A. This is a much 
easier way to arrive at the result we have just obtained. 
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7-7 THE HAMILTONIAN 


Using what we have so far derived, it would be very easy to write 
down the transition amplitude for the hamiltonian. We take the 
transition amplitude for the square of the momentum, divide it by 2m, 
and add the transition amplitude for the potential. In this way the 
hamiltonian itself at the time t, could be written as 


pee (nas = 2) == 2 2) + V(t) | (7-112) 
2 € € 
while in operator form we have the transition element of the hamil- 
tonian as 
مقس ا‎ i, EA RE ; 2 
mw = f, [Ze + ve var = fo xaya (G-18) 


Although this method for defining the transition amplitude for the 
hamiltonian gives a perfectly correct result, it is somewhat artificial, 
since it does not exhibit the important relationship between the 
hamiltonian and time. Therefore, we shall next consider an alterna- 
tive definition of this transition element based upon an investigation 
of the changes made in a state when it is displaced in time. This 
approach will also enable us to define H, given only the form of S, no 
matter how complicated. 

To carry out this investigation, we break up the time axis into 
infinitesimal intervals, just as we did in defining path integrals. 
Now, however, it is important to point out that the subdivision of 
time into equal intervals is not necessary. Clearly, any subdivision 
of time into equal intervals is not necessary. Any subdivision into 
instants t; will be satisfactory; the process of taking limits is character- 
ized by having the largest spacing ti+ı — t; approach zero. 

For simplicity, our system will consist of a single particle moving 
in one dimension. The action is represented by the sum 


S = 3 8] جوت‎ titti ti] (7-114) 
where 
Slrcryteritat] = ff" MEO 2O dt (7-115) 


The integral in this expression is taken along the classical path between 
xi at t; and 2;4; at .رما‎ For our one-dimensional example we can write, 
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with sufficient accuracy, 


Sirti] = [z (za) - 7 (z+ | (f1 — t) (7-116) 
itl T7 يا‎ 

The normalization constant associated with an integral over dz; at the 

time t; is the same one we have used before, namely, 

= 2rhi(ti+1 = t;) 4 

i [ta] (7-117) 

The relation of H to the change in a state with displacement in 
time can now be studied. Consider a state y(t) defined by a space-time 
region R. Now imagine that at the same time ¢ we consider another 
state s(t), defined by another region Rs. Suppose the region Rs is 
exactly the same as R except that it is earlier by a time ô, that is, 
displaced bodily toward the past by a time 6. All the apparatus 
required to prepare the system for Rə is identical to that for R but 
is operated a time ô sooner. If L depends explicitly on time, it too 
is to be displaced; i.e., the state Ys is obtained from the L used for the 
state y except that in writing Ls we use as the time variable t + ô. 

Now we ask: How does the state ول‎ differ from y? In any measure- 
ment the chance of finding the system in some fixed region R’ is 
different depending on whether the region of origin was R or Rs. 
Consider the change in transition amplitude (x|1|¥s) produced by the 
shift in time 6. We can consider this shift as effected by decreasing 
all values of t; by 6 for i < k and leaving all t; fixed for 7 > k. 

If the reader looks ahead at this point, it may occur to him that 
we are headed for trouble. Clearly, it is our intention eventually 
to take a limit as all infinitesimal time intervals are decreased to zero. 
However, with the present setup, at least one time interval ti41 — tx 
has a lower bound, so that it cannot be indefinitely decreased. This 
difficulty could be straightened out by assuming the time shift 5 to be 
itself a function of time. We can imagine that it is turned on smoothly 
before t = t, and turned off smoothly after t = t Then keeping the 
time variation of ô fixed, we can let all time intervals proceed smoothly 
to zero, including يبي‎ — tr. We would then investigate the first-order 
effect of the time shift by letting the magnitude of 6 approach zero. 
The result obtained by this more rigorous process is essentially the 
same as that of the procedure we are using in our present example. 

Returning now to our investigation of the effect of the time shift 
we see that the action S[2i+1,ti+1;2:,ti] as defined by Eq. (7-115) will not 
be changed so long as both t;+ı and t; are changed by the same amount. 
On the other hand, Slaz+1,te+1;7z,te] is changed to S[are41,te+13te,f — 4). 
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Furthermore, the constant A associated with the integration over dz, 
is also altered and becomes 


ee | —t + I" (7-118) 


m 


We use Eq. (7-2) to define the transition amplitude. Keeping 
in mind that the path integral depends on both the action S and the 
normalizing constant A (both of which are altered by our time shift) 
we can write the change in the transition amplitude to first order in ô as 


odil) — وواد‎ = (p | ب انت‎ Bw) 


(7-119) 


the second term coming from the change in A. We wish to define the 
functional corresponding to the hamiltonian in quantum mechanics as 


_ لسأونةن بمارت سمتد]اقة‎ h 3 
Him atk + Hla — te) oe 


The first term on the right-hand side of this last equation is the 
definition of the classical hamiltonian. The second term is necessary 
in the quantum-mechanical definition in order to keep H, finite as 
the time interval يبي‎ — & goes to zero. This last term is a conse- 
quence of the change in the normalizing constant A due to the time 
shift 6. 

Applying this result to the specific one-dimensional example indi- 
cated by Eq. (7-116), we can write the operator H, as 


H = m (zea ey ae Pa سام‎ EV Ga) 


titi — tk 21(te+1 — tk) 
SL LER AY : 
2 0 = a) (2 = et) + Vier) (7-121) 


The second of these equations is based upon the results obtained in 
Eq. (7-54). By writing the product of velocities as the product of 
two successive velocities, we can do away with the apparently extrane- 
ous term %/[27(t.41 — &)]. 
Using the relation t = t — ô for all values of t < tx, we have 
oy oy 


V(t) = Plt) + ة‎ — = + b=, 


connecting the function y defined in the two regions R and Rs. Thus 
the cycle of relations connecting operators to the Schrédinger equation 


194 


Transition elements 


195 


and to path integrals can be closed with the result obtained by combin- 
ing Eqs. (7-119), (7-120), and (7-122). 


GUN 0 
5 (xia at/ 7 p XxlHelY) (7-123) 
which leads us back again to the Schrödinger equation 
i er 
sar = HY (7-124) 


For arbitrarily complicated actions we can find an expression for 
the hamiltonian (i.e., a functional corresponding to the energy) by ask- 
ing for the first-order change in the transition element (x|1|¥) when 
all times previous to ¢ are shifted by —6 and writing this change as 


5(x|H (1) |). 
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Harmonic 
Oscillators 


Tue problem of the harmonic oscillator is perhaps the simplest in 
quantum mechanics. We have solved it completely in noting that? 
the kernel for the motion of a harmonic oscillator is 


mw 4 
K(zo,T ;£a,0) = r7) 
LL 2 = 7 
exp 8 0 [(za + 2,7) cos wT — 22,7] (8-1) 


If we are to make full use of this, we should look at all sorts of problems 
which involve harmonic oscillators, either exactly or approximately. 
It is the purpose of this chapter to describe several such problems, both 
those involving single oscillators and those involving systems of inter- 
acting harmonic oscillators. We could carry this program to extremes 
and include all kinds of classical vibration problems, plates, rods, etc., 
but such systems are so large that it would be a waste of time to 
analyze the quantum-mechanical corrections. Instead, it would be 
better to look at systems on the atomic scale. For example, we might 
analyze the oscillations of the molecule CO. In so doing, we find 
that the potential energy between the carbon and oxygen atoms is not 
exactly quadratic. Nevertheless, for the lower-energy state the 
potential is so close to quadratic that a pure harmonic oscillator treat- 
ment is a good approximation for many purposes. 

In a much more complicated polyatomic molecule, when the excita- 
tion energy is not too high, the travel of the atoms is small compared 
with their spacing. In this case again the potential energy is very 
nearly a quadratic function of the coordinates. Thus the system is 
approximately equivalent to a set of coupled harmonic oscillators. A 
solid crystal is, from one point of view, a polyatomic molecule of great 
size. As such it is a vast array of interacting harmonic oscillators. 

As another example we can consider the electromagnetic field in a 
cavity. Classically, there are several patterns of standing waves, or 
modes, in which the field can vibrate harmonically with a definite 
frequency. In quantum mechanics, each of these modes constitutes 
a quantum oscillator. 


THE SIMPLE HARMONIC OSCILLATOR 


Solution from the Schrédinger Equation. In this section we 
shall develop a number of relations describing the simple one-dimen- 
sional harmonic oscillator. We shall begin with the language of the 
Schrédinger equation. Problem 2-2 gave the lagrangian describing a 


1 Cf. Prob. 3-8. 
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one-dimensional harmonic oscillator as 
2 (42 — wx?) (8-2) 
The corresponding hamiltonian, which we use in the present treat- 
ment, is 
ene | عب لبه‎ 
H om 38 get (8-3) 
The wave equation is then 
hoy _ 
2 
= 
3 (Z + zo) 1 (8-4) 


Since the hamiltonian is independent of time, the wave equation is 
easily separated, and it yields wave functions of steady states of definite 
energy En. The time-dependent part is proportional to exp (¢H,t/h). 

Recalling that the momentum operator p corresponds to differentia- 
tion with respect to x (cf. Sec. 7-5), we can write the Schrédinger equa- 
tion for the spatial part of the wave function as 


_ 82 324, | mex? 


Hon = Əm Ger + pos gon = Enon (8-5) 


This equation is easily solved. The result is given in many books 
on quantum mechanics.! The eigenvalues for the energy are 


E, = ho(n + J$) (8-6) 


where n is an integer, 0,1, 2, .... The eigenfunctions $, are 


dn = (2°n!) (Sy Ha 5 2 سق‎ (muz?/ 2h) (8-7) 


where the functions H, are the Hermite polynomials 
Ho(y) = 1 


H,(y) = (1er e (8-8) 


1L. I. Schiff, “Quantum Mechanics,” 2d ed., McGraw-Hill Book Company, 
New York, 1955. 
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The Hermite polynomials are best defined by their generating 
function 


entity = 0 H,(y) a (8-9) 


We can obtain these results in another manner. The functions 
$n have been obtained by solving a differential equation, namely, the 
time-independent case. However, we already have a solution for the 
time-dependent case. From this solution we should be able to derive 
these functions directly. It is instructive to carry out this derivation 
to illustrate some of the formulas which have been derived in earlier 
chapters. 


Solution from the Kernel. We have worked out the kernel 
describing the motion of an oscillator in Prob. 3-8. On the other hand, 
we know from Eq. (4-59) that this kernel can be expanded in expo- 
nential functions of time multiplied by products of energy eigen- 
functions. That is, 


m % im 
(sate on) exp 86 mee) [ (x1? + £2?) cos wT — 22 Xo] 


)8-10( ماو نودعس J‏ = 


n=0 
Using the relations 
i sin wT = YerT(1 — eT) 
cos wT = Meut eT) 
we can write the left-hand side of Eq. (8-10) as 


mw \% 
ies — (iw 7/2) — e—2iwT\—% 
(¥) € (1 — e-%7) 
mu 1 + eT Ara iT 
X exp {= Oh | + Xo") (E) — عد‎ || )8-12( 


We can obtain a series having the form of the right-hand side of Eq. 
(8-10) if we expand Eq. (8-12) in successive powers of e~*7. Because 
of the initial factor e-7/?, it is clear that all terms in the expansion will 
be of the form e—7/2e-imT for n = 0, 1, 2, . . . . This means the 
energy levels are given by 


E, = h(n + 1) (8-13) 


(8-11) 
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To find the wave functions, we shall have to carry out the expansion 
completely. We shall illustrate the method by going only as far as 
n = 2. Expanding the left-hand side of Eq. (8-10) to this order we 
. have 


Ys 
م‎ me (1 + Ye? + ° + <) exp |- Se (x32 + 292) 


2mw 


7 Lit iT عل‎ . - 1 (8-14) 


a > (x? +- T2) (e 7 ل‎ . e) = 


or 


ma \* -ena (atta) eiut? 17م يم‎ 
= e7 (me 21 z3) ع‎ iw (1 + Lge- iw ) 


2mw seo Amu? 
x |: + 7 206 T + 212 


x£ 12222720T 


= = (21? + g?) tor z 1 )8-15( 


From this we can pick out the coefficient of the lowest term. It is 


(=) er (mal2) (zı? +z) g—(i0 T/2) = g— CIM ET po (2) “ب‎ (x1) (8-16) 
This means that Eo = fiw/2 and 

J4 
oo(z) = (=) قم‎ (maz? / 2h) (8-17) 


We have chosen وف‎ to be real. We could make it complex by including 
the factor ,ع‎ where 6 is a constant; however, it would make no dif- 
ference to any physical result. 

The next-order term in the expansion is 


e—toT /29—iwT T سم‎ (mu [2) (zı +z) ame Lizo = CE CAET p(x 2) pF (21) (8-18) 
T 

which implies that Eı = 3ghw, and 

(£) = 27 voola) (8-19) 


The next term corresponds to E: = 54wh. The part of the term 
depending on zı and zz is 


(=) g(a 2) (zı +z) KS 21229? — (z? + 2 | )8-20( 


71 h? 
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This must be the same as ¢2(x2)¢3(21). Since the expression in the 
brackets can be rewritten as 


3 Ce oie = 1) Cs Pe 1) (8-21) 

we find Š 

pile) =e TO (8-22) 
2 2) 7 0 


These results can be compared with those obtained from the solution 
of the wave equation, given by Eqs. (8-7) and (8-8). 

All of the wave functions may be obtained in this manner. How- 
ever, it is a difficult algebraic problem to get the general form for $n 
directly from this expansion. A less direct way is illustrated in the 
problem. 


Problem 8-1 Note that the amplitude to go from any state f(z) 
to another state g(x) is the transition amplitude (g|1|f) as defined in 
Eq. (7-1). 

Suppose f(x) and g(x) are expanded in terms of the orthogonal func- 
tions ¢,(x), the solutions to the wave equation associated with the 
kernel K (2,1), as discussed in Sec. 4-2. Thus 


f(x) = Zfndn() gl) = Zgndn(x) (8-23) 


Using the coefficients f, and g, and Eq. (4-59), show that the transition 
amplitude can be written as 


UM Ba? (8-24)‏ عل f(a1) day dzo = Egå‏ )51,0 "1 رونه) كل (ونه) * 9ل 


Next, suppose we choose a special pair of functions f and g for which 
the expansion on the right-hand side of Eq. (8-24) is simple. Then 
after obtaining the functions f, we could get some information about 
the wave function ¢, from the expansions of Eq. (8-23). Suppose we 
choose the functions f and g in the following way, 


4 
f(x) = =) e- (mul 2h) (2—a)? (8-25) 


4 
g(x) = هم‎ 2 (2b)? (8-26) 


These functions represent gaussian distributions centered about a and 
b, respectively. We shall call fa = fn(a) and gn = fn(b). Determine 
the transition amplitude (f|1|g), where f and g are given by Eqs. (8-25) 
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and (8-26), respectively, and the kernel is that for a harmonic oscilla- 
tor, Eq. (8-1). Perform the integrals in Eq. (8-24) to get 


exp |- ae = To (a? ++ 02 = 2ebe) | 
= X f(a fO) CDE? (8-27) 


From this result show that En = hw(n + 16) and that 


w n/2 n ر‎ 2 
O (x) a exp 6 ate) (8-28) 


Use this result in Eq. (8-24) and write for ¢, the form given by 
Eq. (8-7) considering the H,(z) still unknown. From this derive the 
generating function of Eq. (8-9) for these functions H,(z). 


THE POLYATOMIC MOLECULE 


In the preceding section we derived the wave functions and energy 
levels which describe the simple harmonic oscillator. In this section 
we begin our investigation of systems of interacting oscillators with 
the study of polyatomic molecules. We begin the analysis by assign- 
ing coordinates describing the position of each atom in the molecule. 
The position of any particular atom a will be given by the three car- 
tesian coordinates Za, Ya, and Za, whose origin lies at the equilibrium 
position for the atom. If the mass of the atom is ma, the kinetic energy 
of the whole molecule is given by 


` 5 Malte? E (8-29) 


where the summation is carried out over all atoms in the molecule. 

It will be more convenient for this general discussion to avoid the 
vector aspects of this description by making the following modification. 
We suppose there are N atoms in the molecule. We shall define 
n = 3N cartesian coordinates in the following way, 


V Mata qs = Vma Ya Gs = V Make‏ = ين 


H =. (8-30) 
qa = كيه‎ ms Le q5 = V 1115 Yo 
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In terms of these new coordinates the kinetic energy has the form 


2 l aa -31 
j=1 
The potential energy is the function V(qı,q2, . . .( of all the dis- 


placements q; We can expand V in a Taylor series around the equi- 
librium position q; = 0. Thus 


V (91,92, HTE E) = Y (0,0, aa 2 ,0( + 1 q;V;(0,0, TES 0) 
j=1 
lv Ñ 
+5 2 > qqxVn(0,0,... ,0( + ° + ° (8-82) 
j=1 k=1 
where 
OV 32V 
TEE Va = ee 
V; ðq; fag an 


The first term is the potential energy at equilibrium. It is a constant 
independent of g. We shall assign the value zero to this constant 
potential and measure all other potential energies in a manner con- 
sistent with that assignment. Thus the first term can be omitted from 
the expansion. The factor V;(0,0, . . . ,0) appears in the next term. 
This is the potential gradient or force associated with the coordinate 
رو‎ and evaluated in equilibrium position. This factor is therefore 
zero, and the term can be omitted from the series. To put this 
another way, since equilibrium corresponds to a minimum of potential 
energy, there must be a vanishing first-order change for displacements 
around the equilibrium position. 

The factors V;,(0,0, . . .) appearing in the next term comprise a set 
of constants whose value depends on the structure of the molecule. 
Call these constants v;,. Now suppose we neglect all higher-order 
terms. Thus we make the approximation that the potential energy 
involves the coordinates only quadratically. Even if the potential is 
not a pure quadratic function of the eoordinates, our approximation 
will be valid for small displacements. It is by this approximation that 
we represent our molecule as a system of harmonic oscillators. 

Combining Eqs. (8-31) and (8-32), we can write the lagrangian as 


1 n : 1 n n 
L= z ep iie 5 0 3 Gide in (8-34) 
j=l E EY 


Next, we introduce this lagrangian into the path integral which defines 
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the kernel describing the motion of the atoms in the molecule, 


x= ff f (ea [Lf oe 


- $ va f aut at]! on 21.0 «De (8385) 


j=l k=1 


All of these path integrals are gaussian, and thus they can be solved 
by the methods discussed in Sec. 3-5. To carry out that solution, we 
shall have to find those paths q;(¢) which give a stationary value for the 
action integral. Variation with respect to each q; gives these paths as 
solutions of 


0 0 Vik Qe (8-36) 
k=1 


This last equation says that the force on any single atom in a particular 
direction is some linear combination of the displacements of all the 
atoms. 

Such systems of interacting oscillators have been analyzed to a great 
extent from a classical point of view. Since in many problems of 
quantum mechanics we obtain the classical action as the first step in 
solving the kernels, all of this classical work will be of great value to us. 
One important result of the classical analysis is the following. There 
are special ways to distort the molecule so that, as time goes on, the 
motion is of the simple periodic sinusoidal type. The pattern of dis- 
tortions remains the same, and only the amount of the distortion 
varies sinusoidally with time. Different patterns of distortion, or, as 
we say, different modes, correspond in general to different frequencies. 
There may be some with zero frequency, and some groups of modes 
may all have the same frequency. The important fact is this: Any 
small displacement motion of the molecule can be built up as a linear 
combination of such modes. 

If there are N atoms in the molecule, then the molecule has n = 3N 
modes of motion. Thus, for example, the molecule CO, has nine 
modes, as shown by Fig. 8-1, where the motion of each atom is indi- 
cated by an arrow. Only modes 1 to 4 are periodic (i.e., have a non- 
zero frequency) and the direction of motion during the first half-cycle 
is indicated. For the second half-cycle, reverse all arrows. 

We shall next derive the mathematical description of the modes. 
This derivation is, of course, part of classical physics, rather than 
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Fig. 8-1 Normal modes of the CO» molecule. The symbol © means motion out of 
the plane of the paper, and Q) means motion into the plane. Modes 1 to 4 are periodic; 
modes 5 to 7 are continuous translations; and modes 8 and 9 are continuous rotations. 


quantum mechanics. Consider a particular mode of frequency w. 
All of the coordinates رو‎ move together and at the same frequency. 
There must be some special set of initial displacements ,ره‎ different for 
each mode, such that if all initial velocities are zero, the subsequent 
motion of any coordinate can be written as 


qj = a; COS wt (8-37) 
Substituting this equation into Eq. (8-36) gives 


wa; = 3 VjKAk (8-38) 
k=1 
The last equation is actually a set of n equations for the n unknowns 


Since it is homogeneous, it has a solution only if the determinant‏ .ره 
of coefficients vanishes. Thus we require‏ 


(w? aa 011) - 5و0‎ 0 — Vin 
= zin وي‎ EF 

E RI PAD Ps - 0 )8-39( 
— 01 a ne mite TA (w? = Van) 


This equation has n solutions for .2س‎ For a particular solution, say wa, 
we can get solutions for the set of equations (8-38). We shall call 
these aja. The sizes of the constants aje are determined relative to 
each other, but the absolute magnitude of the whole set is arbitrary. 
We shall choose this magnitude so that 


3, Gia? = 1 (8-40) 
=1 
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We can repeat this process for all of the n modes, a = 1,2,...,n. 
We determine n values of wa, and for each value of a we obtain n 
solutions for the J constants .يزه‎ Any possible motion of the system is a 
linear combination of these modes. We can write an expression for a 
general type of motion as 


Y, CaGja COS (wat + ôa) (8-41)‏ = رو 
a=1‏ 


Here the constant of amplitude C, and the constant phase 6, depend on 
the initial conditions. That such an expression does represent the 
motion of the system is easily verified by substituting Eq. (8-41) into 
Eq. (8-36). 

It is convenient to use the complex notation for Eq. (8-41). That is, 


n n 
qj = 3 )0 عقاو اماو يروي‎ = > O (8-42) 
a=1 a=1 


where only the real part of the expression has physical meaning. The 
complex constants ca depend on the initial conditions, and they can be 
determined as follows. If the initial positions and velocities are q;(0) 
and ġ;(0), respectively, we have 


(RE Ga) Oi 
f ge (8-43) 
G0) = R.P. Y tcadjatoa = J; [— (Imaginary Part ca)watja] 
a=1 a=] 


R 
ih4s 


q;(0) = Real Part > Calja = 
a=1 


Since the constants aja are all real, this pair of equations determines 
both the real and imaginary parts of Ca. 

We can solve Eqs. (8-43) in a simple way, making use of the impor- 
tant property expressed by Eq. (8-48), which we now prove. For any 
particular a the constants aja satisfy 


Oa Aja — 0 1 (8-44) 
k=1 
If we multiply this equation by ajs and sum over all values of j, we find 
Wa? X وزممزه‎ = Y, J, كن‎ (8-45) 
j=1 k=1j=1 


Since the coefficients v are symmetrical, the left-hand side of Eq. 
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(8-45) will be the same if a and 8 are interchanged. This means 


(wa? = wp?) > 0 وزع‎ = 0 (8-46) 


2-1 


Thus if the frequencies wa and wg are different, it must be that 
3 ajap = 0 (8-47) 
j=1 


If the two frequencies are the same, then the constants aj, are not 
determinant. Instead, we have the freedom to make an arbitrary 
choice which can be made in such a way that Eq. (8-47) is satisfied for 
a عد‎ 6. Thus finally, making use of the normalization defined in Eq. 
(8-40), we can write 


3 0 ورلام‎ = Sap (8-48) 


j=1 
where dag is the Kronecker delta. 

We can now easily find the real part of ce from Eqs. (8-43). Multiply 
the first of Eqs. (8-43) by ajs and sum over values of a. All terms on 
the right-hand side vanish except that for a = 8, which gives 


RER- Ca = 0 aj8q;(0) (8-49) 


g=1 


In a similar manner we can find 
LEC 
I.P. eg = ب‎ h a;gġ;(0) (8-50) 


Thus a complete description of any arbitrary motion of the system can 
be determined from a knowledge of the normal modes of the system 
and the initial conditions of the motion. 


NORMAL COORDINATES 
We can analyze the motion of the system in another way.:- Let us 


choose a new set of coordinates Qae(t), which are a particular linear 
combination of the old coordinates, namely, 


Qa) = Y agilt) (8-51) 
j=l 
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Alternatively, the old coordinates can be given in terms of the new by 


n 


a(t) = Y ajeQa(t) (8-52) 
a=1 

Making use of Eq. (8-48), we can write the kinetic energy of the 
system as 

KE = 5 D å? BIS P dja t;g as =e e Qa? (8-53) 


= 2 5 


The potential energy is 


5 D y ديه‎ 7 0 ١ ` Un Aja xg Qas (8-54) 


=1k=1a=1p=1 

From Eq. (8-38) we have 
> Vj akg = wg ljg (8-55) 
k=1 
which means that the potential energy can be written as [using Eq. 
(8-48) ] 

1 n 1 n 
V = 2 wp QgQa = > djaljg = 9 > OPAM (8-56) 


j=1 a=1 


Thus the lagrangian of Eq. (8-34) can be written in terms of the new 
variables as 


L=3 9 Qe - مه‎ (8-57) 


The lagrangian in this form represents a set of harmonic oscillators 
which no longer interact. That is, the variables are separated. Each 
oscillator has unit mass and its own particular frequency wa. The 
equation of motion for a particular oscillator is 


Qa = ald Os (8-58) 


This means that each mode oscillates freely at its own frequency inde- 
pendently of any other mode. By comparing Eqs. (8-49) and (8-50) 
` with Eq. (8-51) we see that the real part of cg and the imaginary part 
of —cgwg are just the initial coordinate Q,(0) and the initial velocity 
Q,(0), respectively, of the 8 mode. Thus the complicated molecule is 
equivalent to a simple set of independent harmonic oscillators. 


Quantum mechanics and path integrals 


This new set of coordinates Q, which permit us to describe the sys- 
tem as a set of independent oscillators is called a set of normal coor- 
dinates. Using the lagrangian given by Eq. (8-57), we can write the 
path integral describing the motion of the system in terms of normal 
coordinates as 


K = | {exp E ) | û — w0. a |} D91 DQ: ٠ ٠ ١ DQ, (8-59) 


This last result can be obtained directly from Eq. (8-35) by the 
explicit substitution g(t) = 37 jaQa(t). The exponent simplifies just 


as in the classical case, while Dqi ٠ ° ٠ Dan = 201, . . . , DQn at 
least within a constant factor. [Since the transformation of coordi- 
nates is linear, the jacobian is constant. Any such constant can be 
contained in the definition of the normalization factors for the path 
integral DQi(t) DQ) ٠ ٠ -.] 

This form of the path integral can be broken down into a product of 
path integrals. Thus 


acoil {exp | 5m J (û - ata ae |} o. (8-60) 
a= 


where each path integral now describes only one mode and each mode is 
a simple one-dimensional oscillator, for which we have already obtained 
a solution. In this manner any problem of interacting harmonic 
oscillators can be analyzed. 

Since the path integral for the kernel can be separated into a product 
of path integrals, it follows that the wave function for the system in a 
given energy state can be written as the product of wave functions of 
each mode as discussed in Sec. 3-8. 

As shown in Sec. 8-1, the wave functions for each separate mode are 
proportional to exp (tH,t/h), where E, is the energy of the mode. A 


product of such wave functions is then proportional to exp [(it/h) D E,). 


From this it follows that the total energy of the system of oscillators is 
equal to the sum of all the separate energies. The energy in the 
a mode is wa(Ma + 34), where me is an integer. The energy of the 
whole system is then 


E = ħwi(m, + 14) + hwe(me + 4) + ° ° . + on(m, + 14) (8-61) 


where mı, mz, . . . are all integers (including zero). All independent 
choices are allowable because the excitation of oscillator 1 and oscillator 
2 can be in different degrees. 
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If ¢n(Q) is the harmonic oscillator wave function for the nth level 
[as given in Eq. (8-7)], then the wave function for the complete system 
is 


dm,(Qz) bma(Q2) + > dma(Qn) = T] bna(Qa) (8-62) 
a=1 

Each ¢ma(Qa) is as given in Eq. (8-7) with w replaced by wa. In 
this way classical physics, whereby we determine the normal modes, 
and quantum mechanics, whereby we determine the wave function 
and energy levels of a simple harmonic oscillator, are combined to 
give a complete solution for the energy levels and eigenfunctions of a 
polyatomic molecule. 

We can express the wave functions in terms of the original coordi- 
nates q:(t) by using the transformation equation (8-51). For example, 


the lowest energy state of a system, which has the energy L 0 We, has 


a=1 


the wave function 


= il exp (- z=) = exp (- >), a=) 
=o (-3 39 3 St) مم‎ 


a=1j=1 k=1 


That is, the wave function is an exponential function of the quadratic 


1 n n ' f 
form — 5 7 1 M ikqiqk, where the matrix element M is 


j=l 5-1 
Mn = 0 Sats (8-64) 
a=] = 


Problem 8-2 Show that the M,, matrix is the reciprocal square 
root of the v matrix. That is, show 


0 y MiMiWim = Sim (8-65) 
k=11=1 

It may happen that some of the frequencies w are zero. For exam- 
ple, for the molecule و00‎ the modes 5 to 9, as pictured in Fig. 8-1, all 
have zero frequency. They correspond to a translation or a rotation 
of the whole molecule, motions for which there is no restoring force. 
Since there is no restoring force, the assumption that the coordinates 


Qa are small is not generally true. A more exact analysis of the 
translation or rotation kinetic energy must be undertaken. Since 
such motions are not of interest in the present discussion, we shall 
assume that these modes, and their coordinates, either do not exist 
or are never excited, so that we have been dealing with modes for 
which w z 0. If for particular values of a the solutions ثيه‎ come out 
negative (so that w is imaginary), the system is in unstable equilibrium 
for motions in this mode, like a pencil balanced on its point. Instead 
of being simple harmonic, the motion is exponentially divergent, and 
again the coordinates Qa do not stay small. This case again is of no 
interest in the present discussion, and we shall assume that there are 
no such modes. 


THE ONE - DIMENSIONAL CRYSTAL 


A Simple Model. We can think of a crystal as a large polyatomic 
molecule spread out in a three-dimensional array. We can begin 
learning about it by first studying a simpler one-dimensional line of 
equal atoms equally spaced, as in Fig. 8-2. Let the mass of each atom 
be 1 and let the displacement of the jth atom from its equilibrium 
position be .رو‎ We suppose that the motions are restricted to lie along 
the line of the array, i.e., longitudinal motions only. Next, suppose 
each atom interacts only with its neighbor and with a potential V(R) 
which depends upon the separation R between them. That is, we 
suppose the atoms are connected together by a set of springs. The 
equilibrium separation gives a minimum value to the potential. We 
shall assign this minimum the value 0. Suppose AR is the difference 
between the equilibrium displacement and some particular displace- 
ment. We can expand the potential in a power series in terms of AR, 
in a manner analogous to that of Eq. (8-32). We shall restrict our 
attention to those displacements which are so small that all terms 
higher than the second order in this expansion can be neglected. 
Between the jth and (j + 1)st atoms the change in separation away 
from the equilibrium separations is q;}ı — رو‎ = AR;j4:. We shall 
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Fig. 8-2 A model of a one-dimensional “crystal,” with mass particles evenly spaced 
along a line and springs connecting neighboring particles. 
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call the second derivative of the potential with respect to the dis- 
placement »* (the same for all atoms in the string). Then the poten- 
tial energy associated with such a displacement is 


Vise = rga — g)? (8-66) 


and the lagrangian can be written as 


A N-1 , 
4 1 
L= 2 5 0 > 3 (G41 — 93)? (8-67) 
j=l Fon 


If the first and last atoms are unattached, then the term for J = N in 
the expression for potential energy must be omitted. 

Based on this lagrangian the equations of motion for the atoms along 
the line are 


Gi = 101 — a) — (G5 g) (8-68) 


for all ïj except the end points j = 1 and n = N. Now this fact that 
the end particles have to be given separate consideration is just a 
minor annoyance for most problems. Usually we are interested in the 
gross properties for a large solid and are not concerned with surface or 
boundary effects. In such cases the main results desired are really 
independent of the actual boundary conditions (e.g., whether or not 
the end atoms are left free or are tied down, etc.). To avoid this 
problem, theoretical physicists use a trick of assuming a special set of 
simple boundary conditions, called periodic boundary conditions, so 
that these end points do not require special consideration in the 
analysis. Unfortunately, these special boundary conditions occur in 
actuality only rarely, if at all, but for phenomena which are inde- 
pendent of boundary effects the trick is useful. 

The idea is to imagine that the string of atoms goes on beyond N, 
but a displacement of the (N + j)th atom is always exactly equal to 
that of the jth atom. Thus the boundary condition is 


qN+1 = Qı dN+1ı = dı )8-69( 


This boundary condition would be right if our string were tied in a 
circle like a pearl necklace. However, in three dimensions there is 
no such picture to represent the boundary condition, and it must be 
considered completely artificial. 

The value of this particular boundary condition is this. Most 
general ways of terminating the string (e.g., tying the last atom to a 
rigid wall, leaving the last atom free, etc.) result in a reflection of any 


wave traveling down the string. Only if the last atom is tied to 
another string of atoms of identical characteristics will no such reflec- 
tion occur. Thus the boundary condition is analogous to tying a 
transmission line to a characteristic impedance in order to avoid reflec- 


' tions. The characteristic impedance is equivalent to an infinity of 


more line. In the present case we accomplish this end by tying the 
string to itself. We call the boundary conditions periodic, because 
anything that happens at the point k in the string is repeated again at 
the point N + k and again at 2N +k, ete. With this boundary 
condition Eq. (8-68) for the motion of the atoms is valid for all of 
the atoms. 


Solving the Classical Equations of Motion. We assume that 
the displacements g are periodic with frequency w. Then we must 
solve 


v? (G41 — 2g; + g) (8-70)‏ = رودن 


We could write down this set of equations in a determinant, and it 
turns out that the determinantal equation so obtained can be evaluated 
by theorems in mathematics. But this just means that the equations 
can be solved directly, and it is easier to solve them that way. 

We shall restrict the symbol 7 to mean يد‎ —1, and not use it for an 
index. Solutions are of the form 


qj = Aei) = امسو رن‎ (8-71) 


where 8 is a constant taking on a discrete set of values. This solution 
can be verified by substituting into Eq. (8-70). The frequency is given 
by 


wt = )در‎ — 2 + e=) = 4r? sin? É (8-72) 


This gives the values of w in terms of 8, but not all values of 8 are 
allowed. The periodic boundary condition implies that 8 = 2ra/N, 
where ه‎ is an integer = 0, 1, 2, ..., N — 1. (The case a = 0 is 
just a translation, and we can omit it if desired. Furthermore, the 
case given by a = N + a’ is the same as the case given by a’.) Thus 
for any particular choice of a we have the frequency 


> FQ 
Ga = 2ysin WV (8-73) 


and the amplitude for the jth coordinate at that frequency is 
lja = A ettail N (8-74) 
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Equilibrium position 7 
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Fig. 8-3 The displacement of atoms along a string is plotted as the ordinate against 
the equilibrium positions j equally spaced along the abscissa. In the upper case the 
wavelength is long compared to the spacing between atoms (small a). In the lower case 
« = N/2, and the displacements no longer give the appearance of a smooth sine wave. 


The constants aj, determined in the last equation are complex. 
They could be made real by combining solutions for a and —a (or a and 
N — a). However, it is more convenient to leave them in the com- 
plex form. It is convenient to consider both positive and negative 
values of a, however, and so if N is odd, for example, to consider the 
range of a to be —14(N — 1) to +14(N — 1) rather than 0 to N — 1. 

The relative displacements of the atoms in the string depend on 
the size of a. The situation for two values of a, one with a small and 
the other with a = N/2, is shown in Fig. 8-3. 

Although the comparative magnitudes of the various constants 
مزه‎ are determined by Eq. (8-74), the general magnitude, determined 
by the constants A, is still arbitrary. We establish this with a normal- 
izing equation analogous to Eq. (8-48). Thus choose A so that 


N 
2 Q alig = bap (8-75) 
j= 


which implies that 


ae <= (8-76) 


We are now in a position to represent the various modes with their 
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normal coordinates as 
N N 5 
0. = Y iets = D و‎ etre” (8-77) 


N 
where q; = 3 Caljae at, in analogy with Eq. (8-42). These coordi- 
a=1 


nates are also complex, but we can ensure that the lagrangian derived 
from them is real by writing it in the following way: 


N 
L= 5) O20. — 009202) (8-78) 

a=1 
Perhaps this use of complex coordinates Q needs a word of explana- 
tion. Since ,رو‎ the physical coordinates, are real, Eq. (8-77) implies 
Q* = Q_. so that, although two real numbers are really required to 
specify each complex coordinate Qa, only N independent real numbers 
are needed for all of them. If one prefers real coordinates, one 
can define instead two real quantities as coordinates by writing 


$ 1 
Qa = (Qa ت‎ 20.0 75 
vi (Qa + Q-a) (8-79) 


On = WA (Qa I 02 (8-80) 


but the values of a now run the range only from 0 to N — 1. Thena 
term such as the kinetic energy is expressed in real variables as 


Ila)? + (Qa) = QaQ-a = 2 (8-81) 
[The factor 14 reappears in Eq. (8-78) because we sum there over all a, 
plus and minus, counting thereby each term twice, Q*,Q-« = QaQz.] 
Thus AAs expressions derived previously for real quantities 


appear now as products of one complex number by its conjugate 
[for example, Eq. (8-75)]. 


Problem 8-3 Show that Q.°, Qa.“ are normal coordinates, represent- 
ing, however, standing waves +/2 cos (27aj/N) and +/2 sin (2raj/N), 
respectively, i.e., 


34(N—1) La 44(N—1) 2 j 
ns 8 i 2 = 
g= 2 Qa? V3 cos == + 2 Qa’ V3 sin N (8-82) 


for N odd. 
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Problem 8-4 Show that the ground-state wave function for the 
lagrangian of Eq. (8-78) can be written 


N * 
2 (- ; y ¢) (8-83) 
a=0 S 


(where A is a constant) by starting with the wave function in terms of 
the real variables Q«° and 0. 


Problem 8-5 A transition element which employs the same wave 
function as both the initial and final states is called an expectation 
value." Thus the expectation value of F for the ground state © of Eq. 
(8-83) is 


(So|F |b) = f ٠ ٠ ° SEFF dQo dQ ° ° ١ dQ (8-84) 
Show that the following expectation values are correct: 


(Po|Qa|Po) = (#o| QZ |ho) = 0 


(D| Qa. |B) = (Ho|Q2?|ho) = 0 
1 (8-85) 
($o|Q2Qa|ho) = 5 (%o|1|#0) 


($o|Q2Q,|ho) = 0 if a ¥ 8 


Thus with the lagrangian written in terms of the normal coordinates 
we have reduced the system to a set of independent simple harmonic 
oscillators. The quantum-mechanical part of the solution follows in a 
straightforward manner just as it did for the case of the polyatomic 
molecule. All that we need to know is the quantum-mechanical solu- 
tion for a simple independent harmonic oscillator. 


Problem 8-6 Show that the constants aj, are the same even if 
the coupling is not just to the nearest neighbors but extends with 
strengths A, to an atom k spaces away. Assuming à+ falls rapidly 
enough for large k, find the values of the frequency wa when such a 
coupling is present, i.e., when the potential energy, instead of being 
given by Eq. (8-66), is given by a similar equation, but one which con- 
tains the relative displacements of all pairs of atoms, each one multi- 


plied by the appropriate ^+, that is, V = (v?/2) 0) 3 Xe (Qur; — Q)’. 
k j 


1 Compare this definition of ezpectation value with the definition of the expected 
value of an operator given in Sec. 5-3, particularly in Eq. (5-46). 
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8-5 THE APPROXIMATION OF CONTINUITY 


The particular modes which we have determined here are those in 
which each atom oscillates with a phase difference behind the one next 
in line. There is a wave of oscillation passing down the line of atoms. 
If the phase difference between adjacent atoms is small, then the 
wavelength is long. 

Of special interest is the behavior of the atoms in the long-wave- 
length modes. If the wavelength greatly exceeds the spacing between 
the atoms, this spacing is unimportant. In this case the motion can 
be very well described by the fictitious “continuous medium” concept. 
A line of atoms can be replaced by a continuous rod with certain 
average properties, such as the mass per unit length p = 1/d (recall 
that we have assigned unit mass to each atom). More physically, a 
real rod is actually a discrete set of atoms. In this section we shall 
develop the approximation of continuity, wherein a line of atoms is 
replaced by a continuous string. 

For a particular mode of motion the phase difference between 
adjacent atoms is 27a/N, so that a wavelength contains N/a atoms, or 
if d is the equilibrium separation distance between neighboring atoms, 
the wavelength is A = Nd/a. The wave number is 


2r 27a 

k= ا‎ (8-86) 

The wave aspect is made more clear in the mathematical representa- 
tion of the motion by a slight change of notation. We shall refer to 
each mode by its k value instead of by its a value. Then a summation 
a over the modes means a sum over discrete values of k. These 
values are the integers multiplied by 2r/L, where L = Nd is the length 
of the string. Suppose x; = jd is the equilibrium position of the jth 
atom. Then the equations describing the motion of the atom become 


air = م‎ )8-87( 
0 

Qi = e q eikx, (8-88) 
TR 4 
pak 

u= ae), Mem (8-89) 
k=1 
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and 
wk = 2 sin = (8-90) 
We now assume that the separation between atoms is very small 
compared to the length over which disturbances change. Using the 
symbols we have already defined, such situations as this are described 
by kd > 1. If we call the product vd = c, then for kd small we have 
wœ ke. In this situation we can think of the coordinates رو‎ as being 
functions of position along the line of atoms. That is, we can specify 
the displacement of the jth atom, as shown in Fig. 8-3. For long 
waves the displacements q(x;) and q(x; + 1) are nearly equal, and we 
can consider the function q(x) as a smooth continuous function defin- 
ing displacement as a function of position along the line. The normal 
coordinate Q+ is a Fourier transform of q(x). That is, Eq. (8-88) can 
be replaced by 


oa) = Û [* عمو رون‎ ae (8-91) 


This replacement is based upon the approximate relation 


1 eee fan) ee (8-92) 
A L Jo 


which becomes more valid as the spacing between discrete points 
becomes very small. 
A similar relation, namely, 


`u L f2r/a 
$ (a~ fe” ( )dk (8-93) 
k=1 


leads to the inverse transform 


L 2r/d عت‎ 5 
(2) = جل‎ fo Whe dk (8-94) 


To make these quantities of more direct physical significance, let 
the actual displacement of the jth atom be u;. That is, رو‎ = NVM Uj; 
where m is the mass of one atom and is equal to pd. Let the Fourier 
transform of u be U. Thus 


U(k) = 1 u(a)e*= dz (8-95) 
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while the inverse transform is 


1 بها‎ —ikx Re 
ulz) = z- | Ue dk (8-96) 
The new normal coordinate is then U(k), and it is related to the 
previous normal coordinate Q(k) by 


/ mL 
(k) JN Q(k) 
The expression for the kinetic energy in terms of u(x,t) can be worked 
out with the help of Eq. (8-92) to be 


1 du \? 
KE. = 5 it o (2) dz (8-98) 


To determine the potential energy in terms of all the new variables, we 
need to express the difference in the displacements of two adjacent 
atoms as a continuous function of position. Using our approximation 
of continuity, we can write 


an - رو‎ = VT lules) - ulad] ~d Vm E (8-99) 


That means that the potential energy is 


vd? m fL fdou\? 262 م‎ OUN 
In the last equation we have used the constant c = vd. This con- 
stant is actually a measure of the elasticity. We can define it physi- 
cally in the following manner: Suppose we stretch the line of atoms, 
which has length L, by a fractional increase of amount e, that is, to 
the new length L(1 + <). (We are considering a static stretch, not a 
vibration.) This means that we make the separation between each 


pair of atoms equal to d(1 + e) instead of d. Thus the difference in 
displacements of adjacent atoms becomes 


g1 — G = ed 01 (8-101) 


Using Eq. (8-66), this means potential energy put into the string by the 
stretching is 


2 2 
Vie g dmN = 5 en (8-102) 


Thus the force that is needed to stretch the string is, in the 
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limit of small e, 
op 
EF PCE (8-103) 


This last equation gives the stress in the string, and the strain (stretch 
per unit length) is, of course, e. Thus we have 


Stress 
Strain 


= pc” = elastic constant (8-104) 


Combining Eqs. (8-98) and (8-100), we can construct the lagrangian 
as 


AP A pe du\? 
=5 (3) dx — © | (5%) dx (8-105) 


The fundamental modes which we are considering have the form 
e*z, and the normal coordinates are U(k,t). The reader can show that 
the lagrangian can be expressed in terms of these normal coordinates as 


L=£f | 2-4 e ات‎ k?U?(k, ne (8-106) 


ot 


We can consider the system described by this lagrangian as a set of 
harmonic oscillators, one oscillator for each value of k. In our present 
approximation of continuity, k is a continuous variable with an infinite 
number of values. We can reintroduce the picture of discrete atoms 
by remembering that the integral over dk is really a sum over discrete 
values of k, where the various discrete values of k are spaced a distance 
2r/L apart, with L the length of the string, and the number of such 
values is equal to the number of atoms in the string. 

We can get equations of motion in terms of the continuous variables 


T 
by finding the extremum of the action integral k Ldt. Using the 
form of L given by Eq. (8-105), the resulting equation of motion is 


0u 2 O7u 


Following a line of argument demonstrated by Eq. (8-99), we can see 
that this equation of motion is analogous to the previous equation of 
motion which we derived, namely, Eq. (8-68). Equation (8-107) has 
the solution 


u = e—wta(z) (8-108) 


in analogy with Eq. (8-71), where 


ee 6) (8-109) 
dx 

in analogy with Eq. (8-70), and 

a(x) = e* . )8-110( 


in analogy with Eq. (8-74). 

Combining Eqs. (8-109) and (8-110), we see that w = ke. This is 
the analogue of Eq. (8-90), and, as a matter of fact, in the limit of 
small k, Eq. (8-90) reduces to this relation. 

The motion described by Eq. (8-108), with the value of a given by 
Eq. (8- 110), is that of the traveling wave moving with velocity c. 
That is to say, c is the speed of sound along the line of atoms. Actu- 
ally, a real system shows dispersion; that is, w is not proportional to k. 
For wavelengths which are of the same ee as the atomic spacing 
this lack of proportionality becomes important, as shown by Eq. (8-90). 


QUANTUM MECHANICS OF A LINE OF ATOMS 


The behavior of the atoms in a string can be described in terms of 
modes of motion. Each mode is a harmonic oscillator. The energy 
state of any particular mode is determined by the quantum number for 
that mode. Each mode is identified by its wave number k or its fre- 
quency w. A mode of frequency w can have the energy values hiw/2, 


Bhw/2, 5tiw/2,..., or 0, hw, 2hw, . . . , above the ground state 
energy of fiw/2. For these cases we would say that there are 0, 1, 
2, . . . phonons of wave number k (or frequency w) present. 


It is possible to have several different modes excited simultaneously. 
For example, we could have (1) the mode of wave number k, excited to 
its first level above the ground state, (2) the mode of wave number ke 
excited to its first level also, and (3) the mode of wave number ks 
excited to its second state above the ground level. The state of the 
complete system would then have the total energy h(wi + wa + 2ws) 
above the ground state. We would say that there are four phonons 
present, one phonon of wave number fi, one of wave number kz, and 
two of wave number ks. 

The ground state of the entire system has the energy 


Es yee (8-111) 
k 
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Using the approximation of continuity and letting w = kc, this 
becomes 
Lf kx NEC 
Emi = 5 k -> dk (8-112) 


We notice that if the upper limit kmax on the integral over k goes to 
infinity, then the integral diverges. However, the form w = ke 
which we have used in this expression is valid only for long waves 
(i.e., small values of k). 

We can make a better determination of the ground-state energy 
by using the correct expression for w and establishing a reasonable 
upper limit for the integral over k. Thus, using Eq. (8-90) for Wk, 
we can write the ground-state energy as 


k=kmax 
akd 
End z” sin a (8-113) 
k= — kmax 
where 
2 
2 (8-114) 
This can be rewritten as 
n=N/2 N/2 
Ay | sin T | = 2hv(LP.) > girnlN )8-115( 
N 
n=—WN/2 n=0 
For a very large N this result can be approximated by 
1 
Bona = 2ħcL و‎ (8-116) 


The results show that the energy is proportional to the length of 
the string, but apparently the energy has no limit as the spacing d 
approaches zero. That is, the ground-state energy is infinite for a 
continuous medium. Of course, for real matter the energy is finite. 

It is very convenient to measure, instead of the total energy, the 
excess energy above the ground state. There are two reasons for this: 
(1) Really, the ground-state energy is not known, nor is it usually 
interesting to the physical problem in question. For example, the 
true ground-state energy includes all of the energy of the electrons 
attached to the atoms. (2) When dealing with the excitation of only 
long waves, the approximation of continuity is very useful, and it gives 
a good approximation to the excitation energies. However, this 
approximation gives an invalid result for the ground-state energy, 
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since it neglects the separation d (i.e., treats das 0). Thus, we must 
avoid the necessity of evaluating the ground-state energy if we are to 
use the approximation of continuity. 


THE THREE - DIMENSIONAL CRYSTAL 


There is no difference in principle between a realistic three-dimensional 
crystal and the one-dimensional example which we have been consider- 
ing. However, the detailed evaluation of the various modal fre- 
quencies is much harder. Results can be obtained ‘in terms of the 
wave number k, which is now a vector with the components kz, ky, and 
k.. The frequency, written in terms of these components, is generally 
very complicated. There is more than one solution for each value of 
k because of the possibility of various polarizations (directions of 
vibration). Furthermore, a real crystal often consists not of an array 
of atoms equally spaced, but rather of an array of unit cells, each unit 
cell consisting of a group of atoms in some characteristic geometrical 
arrangement. If there are several atoms (say p) in such a unit cell 
(and this example can be illustrated with a one-dimensional approxi- 
mation), then there are 3p frequencies for each value of k. 

In the three-dimensional crystal we can still use the approximation of 
continuity to good advantage. In this approximation the true lattice 
structure of the crystal generally makes itself felt through the exist- 
ence of different properties in different directions (e.g., anisotropic com- 
pressibility). The symmetry of the lattice is reflected by the sym- 
metry of elastic constants. Furthermore, the fundamental modes 
have vibration directions (polarization directions) which are not neces- 
sarily either parallel to or perpendicular to the direction of propagation 
of the wave. 

For the present discussion, we shall assume that our substance 
displays the same elastic constants in all directions. (In general, it is 
not necessary for any crystal, even one as symmetric as a cubic 
crystal, to do this.) Then the waves are of two kinds, longitudinal 
and transverse. These two kinds of waves have different wave 
velocities, which we shall label cz for the longitudinal and cr for the 
transverse. For each k there are three modes. One of these has the 
frequency wz = czk (where k is the absolute magnitude of k). Since, 
by hypothesis, there is no directional effect, the frequency is a func- 
tion only of the absolute magnitude of the wave number and does not 
depend upon its specific components. There are two transverse 
modes (i.e., modes in which the direction of motion of the atoms is 
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perpendicular to the direction of motion of the wave), both of which 
have the frequency wr = crk. 

Every separate mode, and that includes every separate direction of 
polarization, behaves like an independent oscillator. 

Suppose we are dealing with the crystal of volume V. Let us com- 
pute the number of modes whose wave numbers lie in the k-space 
volume element dk = dk, dk, dk, centered about the point k. We 
assume the crystal is rectangular with edge lengths Le, Ly, and L,. We 
use the results obtained from the one-dimensional example to see 
that the discrete values of k+ are spaced apart a distance 2r/L,. So, 
in the range of length dk, there are dk,L,/2m discrete values of ke. 
Applying this same reasoning to the other directions, we find that the 
number of discrete values of k included in the interval is 


dk, dk, dk, atk 

ay elle = GeV د‎ 
This same result is obtained (in the limit of large crystals) for any 
shape. 


For the general case the modal frequency wx is, as we have men- 
tioned, a very complicated function of k with several branches (values 
for the same k), but its determination is a problem of classical physics; 
then the forms of oscillation in the fundamental modes are known, as 
are the normal coordinates describing these modes. The quantum- 
mechanical problem is then reduced to the solution of a simple set of 
oscillators, and all the properties can be worked out easily. The 
excitation of each mode is called the excitation of a phonon. 

As a very simple special example, we shall consider the longitudinal 
modes of oscillation in an isotropic solid (i.e., sound or, in particular, 
longitudinal sound). We can start as we did in the one-dimensional 
example with the atoms in the crystal discretely spaced and later pass 
to the long-wavelength limit, or approximation of continuity. 

A complete solution would show us all the effects of dispersion, the 
complex branches, and the transverse waves. It is a very interesting 
study. However, one need not carry out all of the steps in order to 
obtain the proper quantum-mechanical form of the continuity approxi- 
mation. One can make use directly of the results of classical physics. 
The entire procedure, starting with discretely spaced point masses, 
then passing to the long-wavelength limit, is just as useful and just 
as valid in quantum mechanics as it is in classical physics. The 
lagrangian has the same form so long as one restriction is imposed, 
i.e., that the potential can be adequately represented by a quadratic 
function of the displacement. The reason for the similarity between 


the results of the classical and quantum-mechanical approach is 
that the procedure consists only of various linear transformations, 
e.g., transforming to normal coordinates followed by certain approxi- 
mations, such as the approximation of continuity. These trans- 
formations and approximations can be done in quantum mechanics 
just exactly as they are done in classical physics. 

The equations derived from classical physics are as follows. Sup- 
pose u(r,t) represents the displacement of a particle whose equilibrium 
position is at r. We assume that we are working in the long-wave- 
length region, so that the approximation of continuity applies. A 
plane-wave mode is easiest to describe in terms of the Fourier trans- 
form given by 


U(k,t) = iL u(r,f)e** dîr (8-118) 


where r is a spatial vector having the components 7, y, 2. The normal 
coordinates of the various modes depend on the relationship between 
the direction of U and the direction of the vector k. That is, the 
coordinate U,(k,t) of the vector U does not necessarily represent a 
normal mode. For an isotropic material the three modes of a given 
k have the following normal coordinates: 


Usk) = =F (8-119) 


(that is, the component of U in the direction of k) and 


U2(k,t) = eı U (8-120) 
U3(k,t) = e U (8-121) 


where رع‎ and ex are two unit vectors perpendicular to k and perpendicu- 
lar to each other. For the present study we shall restrict our attention 
to just that part of the kinetic and potential energy which arises from 
the longitudinal modes given by Eq. (8-119) and omit the transverse 
oscillations. 

Using the results of classical physics the lagrangian for the longi- 
tudinal modes can be written as 


L=£ fff {| eae) i(k, اك‎ — ck[U ı(k, 0)]? lay om (8-122) 


Here we have introduced the speed of sound c = w/k, which is a func- 
tion of the direction of propagation. This is a direct generalization 
from the one-dimensional example. In terms of the original variables 
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u(r,f) the lagrangian is 


E 


The first term on the left-hand side of this equation is the kinetic 
energy, given by one-half the mass times the square of the velocity. 
The second term is the energy of compression given by V ٠ u, which is 
the compressional strain. No energy of sheer strain is included here 
because we have disregarded transverse elastic waves. 

Variation of the lagrangian with respect to u produces the classical 
equations of motion as 


ae c? (y. J dîr (8-123) 


ee = —Vv(V-u) (8-124) 
If we define a compressional strain function equal to the divergence of 
u, that is, 

p= V ‘u (8-125) 
we have the result 

oe ekg (8-126) 


which is the classical wave equation. 
The Fourier transform of Eq. (8-124), using the kernel exp (ik ٠ r) 
and taking the component of the result parallel to k, gives 
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aes RU, (8-127) 


This is the equation of a single harmonic oscillator, and it shows us 
that U,(k,t) is indeed a normal coordinate. 

The quantum-mechanical results from the lagrangian given by Eq. 
(8-123) can be obtained easily. The energy levels of the mode in 
question are given by nfi(kc) above the ground state. Let us ask for 
the amplitude to go from a fixed initial set of coordinates u(r,0) to a 
given final set of coordinates u(r,7). It is 


Klu(r,T),T ;u(r,0),0] = 
CE we dîr |) D*u(r,t) 


0 (ep = = R p Ie iB Ot (8-128) 


The path integral of Eq. (8-128) is carried out over the paths u(x,t) 
defined in terms of all three components of the vector r, as well as the 
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time t. It is subject, of course, to the condition that the function 
u(r,f) take on a given form at both the initial and final points. This 
is an interesting extension of our original path integral idea. Up to 
now we have dealt with integrands which were functionals of one (or 
perhaps a few) function z(t) of one variable t, and we have carried out 
the integration over all such paths, or functions. Now we must 
integrate a functional of the function u(r,t) of four variables 7, y, z, 
and t and carry out the path integration over all values of this function. 
We can accomplish this by the regular techniques which we have 
described before, for our integrand is still a gaussian functional. 

The first step in the solution of the path integral is to find the path 
which leads to a stationary value for the integral appearing in the 
exponent, the one which satisfies Eq. (8-123), or, more conveniently, 
the wave equation given by Eq. (8-126). We must impose the 
required boundary conditions at the times t = 0 andt = T. Satis- 
fying the boundary conditions is not a difficult problem; however, it is 
a little different from the usual problem in classical physics in which 
the coordinate and its derivative are given at t = 0, that is, u(r,0) 
and (du/dt):.0. 

We could proceed along this line and solve the problem. However, 
we have learned from previous examples that it is much easier to 
transform the problem into normal coordinates before carrying out the 
path integral. Such a transformation gives us 
Mae {exp be 7 / if (Uy? — kU) a|} DU,(k,i) (8-129) 


KE 


where the boundary values are given by 


k 
= = — o ik-r 

Ui(T) = Ux(k,T) = 7 i 1 / eu (r,T) dîr 

)8-130( 
U:(0) = Ui(k,0) = ٠ f 1 Í ety (1,0) dîr 
This is once more the simpler type of path integral, where the path is 
described in terms of only the one variable t. Since the path integral 
can be written as a product of path integrals, each one defining the 


motion of a normal mode, we find that we have already solved the 
problem. The result is [cf. Eq. (8-10)] 


2 pkc % ipke 
K 5 Il 6 sin FT) exp (x sin keT | U;*(k,T) U:4060) | 


x cos kcT — 2U,(k,T) 0)) ) (8-131) 
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In the products over the components of k, the نه‎ component, for 
example, takes on the values 2rn,z/L, where n, is an integer running 
from 0 to N = L,/d. Here d is the spacing between atoms, and we 
have assumed that the solid we are studying has edge lengths Lz, 
Ly, and L,. Of course, the approximation of continuity 56 
a zero spacing between the atoms, which means that the product is 
unbounded. However, we shall disregard such problems and con- 
centrate only on the form of those terms showing dependence on 
initial and final coordinates. Thus, disregarding the radical which 
multiplies the exponential term of Eq. (8-131), we can write this 
equation approximately as 


K ~ exp 0 f / f kc{[U:?(k,T) + U12(k,0)] cos keT 


dk 
(2r) 


The dependence of the amplitude on the boundary values U,(k,0) 
and U,(k,T) is contained in this last result. For any choice of these 
functions [and they, in turn, depend on u(r,0) and u(r,T), as shown by 
Eqs. (8-130)] the integration in Eq. (8-132) can be carried out, formally, 
and a final answer obtained. In this manner all questions about the 
quantum-mechanical behavior of the system can be answered, at 
least in principle. 


— 2U i(k, T)Ux(k,0)} (sin keT)— (8-132) 


QUANTUM FIELD THEORY 


Suppose we are dealing with waves or modes which are described by 
continuous functions, like u(r,t), for which there is no atomic sub- 
structure or for which the wavelengths are long enough that we can 
neglect such a substructure. In this case we say that u(r,t) is a field, 
i.e., a function of each point in space. In the example we have just 
considered the field is the displacement field of sound. In this ter- 
minology the equations of motion are called the field equations. In 
the present chapter we have been dealing only with linear field equa- 
tions. The lagrangians can be called the lagrangians for the field. 
The normal coordinates U(k,t) are the coordinates for the normal 
modes of the field. The description of these modes as quantum 
oscillators is called quantizing the field. The resultant theory is called 
quantum field theory, to distinguish it from the classical analysis of 
the equations. 

As we have seen, almost all of the effort in quantum field theory 
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is devoted to solving the classical equations of motion to find the nor- 
mal modes, an activity completely within the realm of classical physics. 
The “quantization” consists then of no more than the additional 
remark that each of the normal modes is a quantum oscillator, with 
the energy levels iw(n + 14). Presented in this way, quantum field 
theory seems to be just a special consequence of the Schrédinger 
equation, and not an extra theory at all. 

That is, or should be, the case for any situation in which the field 
variables (like sound displacement or pressure) are defined ultimately 
in terms of some combination of the basic mechanical variables. 
These basic variables describe the position of the particles, atoms, 
electrons, and also nuclei, which comprise the material carrying the 
field. For example, in the case of sound we assume that Schrédinger’s 
equation describes the motion of the constituent parts, or atoms, in a 
crystal. Then we easily deduce that the long-wavelength sound 
waves obey the classical linear field equations, and we find that the 
modes are quantized. 

In a few cases the classical equations of some field pertaining to a 
system are known, even though the quantum-mechanical derivation 
starting from Schrédinger’s equation has not yet been made. For 
example, the equations describing the oscillations of a drop of nuclear 
matter have been guessed by classical analogy.! In such a situation it 
is an excellent guess that the modes of the field will turn out to be 
quantized oscillators if and when the complete quantum-mechanical 
derivation is worked out. Actually, not many such examples are 
left. Nearly all cases have by now been worked out. 

Another type of field equation, fundamentally different from that 
described above, exists in quantum mechanics. An example of this 
type is Maxwell’s set of electromagnetic equations, a set of linear 
field equations. These equations lead to a wave equation which 
is analogous to the one we developed for sound, although there are 
different polarization conditions. Just as an organ pipe has stand- 
ing waves, or modes, so an electromagnetic field in a cavity can be 
described classically in terms of fundamental modes of oscillation. 
It is a natural inference that these oscillations are also quantized in 
the sense that each mode can have the energy levels Awn above the 
ground state, etc. This is the fundamental assumption of the quan- 
tum theory of electromagnetism. It is not a strict deduction from the 
Schrédinger equation for matter, because the electromagnetic field 
is not understood as a long-wavelength approximation of an atomic 
medium. Today, we do not think of any particular medium, but take 


1M. 58. Plesset, On the Classical Model of Nuclear Fission, Am, J. Phys., vol. 9, 
no. 1, pp. 1-10, 1941, 
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the equations of Maxwell for granted. We simply assume they are 
to be quantized in the simple direct manner described above. We 
shall discuss this example in more detail in Chap. 9. 

The assumption of quantization for the electromagnetic fields turns 
out to be consistent with all experiments carried out so far, although 
there are some theoretical difficulties. These difficulties are associated 
with the extension of the scheme to modes of very short wavelengths. 
There are various effects which lead to diverging integrals if the inte- 
grations are carried toward zero wavelength. The corresponding 
difficulties do not really arise in a vibrating crystal because if we wish 
to carry the analysis into the very short wavelength region, where the 
wavelengths are comparable to the atomic spacing, we must drop the 
approximation of continuity. Then in the case of a crystal we find 
that there are only a finite number of modes in any finite volume, while 
in electrodynamics the number of modes in any volume is infinite. 

When the various modes of a field are excited, we say there are 
“things” present which have different names for different cases. For 
sound or crystal vibrations we call them phonons, for the electromag- 
netic field photons, for meson field theory mesons, etc. Even electrons 
can be represented as being excitations of a field, but it is a field of a 
very different kind from what we have been discussing. It is called a 
Fermi field; the particles obey the exclusion principle, and the lagran- 
gian is quantized not by representing it as a set of harmonic oscillators, 
but in a different way. Fields quantized as modes of harmonic oscil- 
lators are called Bose particles; they obey Bose or symmetric statistics. 
This just means that if one has two particles, one of wave number Kk, 
and one of kz, there is only one state. There is no new state where the 
first has kz and the second has kı. This is because for our field there is 
only one state with kı and kz each excited to the first level. It has 
energy fiw: + ws, and it is meaningless to ask: After an exchange, 
which excitation is which? In the next chapter we discuss this in 
more detail for the case of photons of the electromagnetic field. 


Problem 8-7 It is believed that neutral particles of spin zero (like 
neutral pions) can, when free, be represented by a field ¢ with a 
lagrangian 


1 8 2 204 
L= Is 8 ال‎ Mae 3 dîr dt (8-133) 


where بر‎ is some constant. Show that this field has quantized states 
corresponding to waves exp (7k+r), where the energy of excitation is 


ho = Vk? + pct (8-134) 
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If hk = pis considered as the momentum of each excitation the energy 
is 


E = Vp? + ute! (8-135) 


This is the relativistic formula for the energy of a particle of momen- 
tum p and mass .بر‎ (Note: For p? small it is approximately 


2 
E 


the rest energy uc? plus the kinetic energy p?/2u.) 

We interpret the state of the field when the mode k; is excited to the 
second quantum level, kz to the first, etc., as the state of a system 
containing two particles of momentum ñik, one with fk», etc. The 
ground state is considered the state in which no particles are present, 
and it is called the vacuum state. Excitation or deexcitation of the 
field oscillators corresponds to creation or annihilation of particles, 
and this is the way that such processes are represented in relativistic 
quantum field theory. 


THE FORCED HARMONIC OSCILLATOR 


In this chapter we have dealt with the simple oscillator or with systems 
that could be reduced to a set of such oscillators. But the oscillators 
have been free, not interacting with anything else. We must develop 
our analysis further if we wish to deal with such linear systems in 
interaction with other systems or driven by external forces. Exam- 
ples of such systems include polyatomic molecules in varying external 
fields, colliding polyatomic molecules, crystals through which an 
electron is passing and exciting the oscillator modes, and other inter- 
actions of the modes with external fields. We shall not discuss the 
problem of interaction in general; instead, we use as a prototype the 
example of the interaction of atomic systems and charges with the 
electromagnetic field. We do this in the next chapter. Other cases 
may be analyzed by direct analogy. 

These problems involve two aspects: (1) the resolution of the field 
into its component independent oscillators and (2) the interaction of 
each oscillator with external potentials or other systems. The resolu- 
tion into oscillators has been exhaustively studied so far in this chapter. 

To prepare the complete machinery for such problems, it remains 
only to analyze the behavior of a single oscillator disturbed by an 
external potential. We shall put these pieces together in the next 
chapter. 
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In this section we go back to the study of a single harmonic oscillator, 
but coupled linearly to some external potential or disturbance. The 
lagrangian for such a system is given by 

2 

f= va — =o x? — y(t)r (8-136) 
where y(t)'is the external force. We assume for convenience that it is 
turned on only during a certain time interval T from t = 0 tot = T, so 
that the oscillator is free initially at t = 0 and finally att = T. We 
have already completely solved this problem, obtaining the amplitude 
K(b,a) that the oscillator goes from a point 2, at t = 0 toa point x, at 
t= T, in Prob. 3-11. But for the present applications it is con- 
venient to find as well the amplitude G,,, that the oscillator initially in 
state n is found at time T in state m. This representation is often 
more convenient than the coordinate representation. 

In Sec. 8-1 we determined the wave functions n for the free 
harmonic oscillator, and in Prob. 3-11 we evaluated the kernel 
describing the motion of a forced harmonic oscillator. This means 
that we can determine the amplitude Gmn by direct substitution into 


Gan = eman f“, |) on(a)K(un,T te0) bale) dza dts (8-137) 


For the case m = n = 0 this integral is a gaussian somewhat lengthy 
to evaluate but presenting no special problems. The result is 


1 y 
E E seca Ih [i Wve» as 4 | (8-138) 


If m and n are not equal to 0, then the integral is somewhat more 
complicated. However, we can use the same sort of trick that we 
used in Prob. 8-1. We shall ask for the amplitude that a forced 
harmonic oscillator goes from the state f to the state g, where these 
two states are defined in Prob. 8-1. This amplitude is 


F(a) = Y Y نس‎ faae Enr 
m=0 n=0 


eo © M 
= À > Grn {exp |- m+ » |} 
m=0 n=0 
a”b™ M N (m+n1/2 ais 3 


where M is the mass of the particle [see Eq. (8-28)]. If we can work 
out F (b,a), we can get Gmn by multiplying F by exp [(Mw/4h) (a? + 6?)] 
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and developing the resulting expression in a power series in a and b. 
That is, we want first to solve 


F(b,a) z ies E e~ (Mol 28) (2° K (£9, T 21,0) ee e—a" 021 dz: 

í (8-140) 
where K(z:T;xı0) is the kernel for a forced harmonic oscillator 
[Eq. (3-66)]. The variables appear only quadratically in the exponent 
of this integrand, so that the integration can be performed easily. 


Some of the resulting algebra is a little lengthy; however, eventually 
one finds 


F(b,a) = | exp |- se (a? +b? — 2abe-#7) + i L (aB + bB*e-#7) 


ag -e if $ ik y(t)y(s)e t» ds a |} e-112 (8-141) 
where 
1 
M ن 2 ليد‎ 
1 
M ن 2 ليد‎ 


The values of Goo can be obtained easily from Eq. (8-141) by setting 
a=b=0. The result is the same as Eq. (8-138). Next we multiply 
by the exponential function, as described earlier, and find, by putting 


1. (Mw a [Mo ar 
co oR 0 y= “og Oe 


that 


f ve at (8-142) 


B= 


p* = 


f ver at (8-143) 


2 0 Gan AT = [exp (xy + 78x + iß*y)]Goo (8-144) 
m=0 n=0 7 Š 


By expanding the right-hand side in powers of ند‎ and y and comparing 
terms, we obtain the final result: 


Gmn 1GB) (GB (8-145) 


_ _ Goo m! n! 
Vmin! X (m — r)!r! (n — r)!r! 


where l = m or n, whichever is the larger. 
This completely solves the problem of a forced harmonic oscillator. 
We shall discuss it further and make use of it in the next chapter. 
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In this chapter we shall discuss the interaction between charged 
particles and an electromagnetic field. We have seen one example of 
such an interaction in Sec. 7-6, where the electromagnetic field varia- 
bles entered into the potential term of the lagrangian. The electro- 
magnetic term introduced in that section is the vector potential A. 
Section 7-6 deals only with the-:motion in a definite given field. It 
does not tell us anything about how the field A arises or how it is 
affected by the moving particles. That is, the formulation of the 
problem does not contain any analysis of the dynamics of the field. 
Such an approach, using given potentials, is only an approximation. 
It is valid when these potentials arise from such large pieces of appara- 
tus that the motion of the particle does not affect the potential. 

In this chapter we shall be concerned not only with the way in 
which potentials affect the motion of the particle, but also with the 
way in which the particle affects the potentials. We shall start with 
the classical approach and use Maxwell’s equations to describe the 
dynamics of the electromagnetic field. These equations express the 
field in terms of the charge density and current of the matter present. 

We have found in preceding chapters that the quantum-mechanical 
laws which correspond to some classical system can be easily deter- 
mined if only we can express the classical laws in the form of a least- 
action principle. Thus we have found that if the extremum of some 
action S, varied with respect to some variable g, corresponds to the 
classical equation of motion, then the quantum-mechanical laws are 
expressed as follows: The quantum-mechanical amplitude for any 
given situation, corresponding to the action S, is the path integral of 
e‘S/* integrated over all possible paths of the variable و‎ which fit the 
conditions of the situation. 

It is vital to our present approach that classical electrodynamics, 
as expressed by Maxwell’s equations, can be written as a principle of 
least action. An action S exists which can be expressed in terms of the 
vector and scalar potentials A and ¢. The determination of an 
extremum for this action, by variation of the field variables (r,t) and 
A(r,t), leads to a formulation of electrodynamics equivalent to Max- 
well’s equations. Hence, quantum electrodynamics results from the 
rule that the amplitude for an event is 


K(2:1) = if * eM SIA.9] DA(r,) 54), (9-1) 


where the path integral is over all values of A and ¢ at each point of 
space and time, subject to the boundary conditions at the initial and 
final points of the event. 
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9-1 CLASSICAL ELECTRODYNAMICS 


Maxwell’s Equations. We shall begin our study of electro- 
dynamics from the customary classical fundamentals, i.e., from Max- 
well’s equations. We shall assume the magnetic permeability and 
dielectric constant are those for free space. Then, with E as the elec- 
tric field vector, B as the magnetic field vector, c as the speed of light, 
j as the current density, and p as the charge density, Maxwell’s 
equations are 


V.E = م47‎ (9-2) 
1 /dE : 
9 ١.8 = (9-4) 
1 0B 
VxE=-—- AT (9-5) 


These equations make sense only if charge is conserved, that is, 


POP X 
VAR eer (9-6) 
Equation (9-4) implies that B is the curl of some vector A: 
B=VxA (9-7) 


This relation does not fully determine A; we still may specify its diver- 
gence. We choose 


v-A=0 (9-8) 


This choice is not recommended if it is desired to keep the full relativis- 
tic four-dimensional symmetry of the equations in evidence. [It is 
not that the results using Eq. (9-8) are not relativistically invariant; 
for the results are independent of the choice of V- A. It is, rather, 
that the invariance does not appear obvious at first glance.] In our 
case we shall deal with matter in the nonrelativistic approximation 
anyway (for we do not have a simple path integral for the Dirac 
equation). We wish to illustrate the properties of the quantized 
electromagnetic field, and the results are least cumbersome with the 
choice of Eq. (9-8). 

Substitution into Eq. (9-5) shows that E + (1/c) dA/dt has zero 
curl, so it must be the gradient of some potential 


Eevee E (9-9) 
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If there is no charge and no current density, the equations are easily 
solved. From Eqs. (9-2), (9-8), and (9-9) we see 


V.E = —V’¢ = 4rp (9-10) 


If م‎ = 0, this means ¢ = 0, and E = —(1/c) dA/dt. In Eq. (9-3) 
with j = 0 this gives [note: V x (V x A) = V(V- A) — V°A] 


1 4 


2 چ‎ aE ميد سمس‎ 
vA c? ot? 


=0 (9-11) 
Thus, each component of A satisfies a wave equation. 
If we expand A in running plane waves, that is, 


A(R,t) = ax(t)e™** (9-12) 


the equation for the amplitude a; is 4, = —c?k?a,, which implies that 
ax is a simple harmonic oscillator with frequency w = ke for each 
component direction of a. Actually there are only two independent 
transverse waves; the component of a, in the direction of k must be 
zero. That is the implication of Eq. (9-8), which can be rewritten as 


k-a, = 0 (9-13) 
Thus the field in free space is equivalent to a set of free harmonic 


oscillators with two transverse waves for each value of k. 


Problem 9-1 Show that E, و8‎ and k are mutually perpendicular 
for this plane-wave solution. 


Solution with Charges and Currents Present. We shall again 


expand the solutions for A, ¢, and the current and charge density in 
plane waves, writing 


A(R, = Våre 1 ar (de® ع‎ 


un dk 
g(R,t) = | xe Ory 
f eee rate ae) 
IRO = fie كس‎ 


A(R) = f oDer oa 


Problem 9-2 Explain why the charge density corresponding to 
a single charge e located at the point q(t) at time t is 


p(z,y,2,t) = ed[z — q-(t)]õðly — qu(d)]élz — [0)ءو‎ = e8 TR — [0)و‎ 
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Show that 


pr = ee tka) (9-15) 


The current density is j(R,t) = eq(#)6°[R — q(t)]. Ifwe have anum- 
ber of charges e; located at q(t), the values of px and jg are 


ee ai ix = >} e{;(t)e ik: )9-16(‏ 2 اوم 


Here Eq. (9-13) still holds, and we may use it to simplify some ex- 
pressions. The coefficient of expansion of B is Bx = ~/4m ci(k X ax); 
of E, it is Ex = —ik¢, — Vår ay. The coefficient of expansion of 
V-Eiszk- Ex = kp, so we have _ 


Kp = 4rpr (9-17) 


or يف‎ = 4rp:/k?. The function ¢; is completely determined in terms 
of the charge density زيم‎ there are no dynamic differential equations to 
solve, involving, for example, $x. 


Problem 9-3 Prove that the relation ¢, = 47p;/k? simply means 
that ¢, at any instant is the Coulomb potential from the charges at 
that instant, so that, for example, if p comes from a number of charges 


e; at distances r; from a point, the potential at the point is ¢ = 0 e,/T;. 
This is just the content of Eq. (9-10). 
Equation (9-3) still remains to be solved, that is, 


ik x B, = E = : Aaj, (9-18) 


But ik x B, = — Vår ck x (k xX ax) = V 4m cha and É, = —ikd, 
— vV 4r är, and using Eq. (9-17) to express يف‎ as 4rpx/k?, we get 


i + Hoa, = Vir (ie G) = Vie i (9-19) 


where we can call jj, = jx — ikp;/k? the transverse part of jx. The 
law of conservation of current, expressed by Eq. (9-6), says that 


px = —ik- jx; so 
/ : k({k: j 
foie Se = ja (9-20) 
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which means that ją is jx less its component in the direction k. 
Clearly, k -jį = 0. 

We have certainly reduced Maxwell’s equations to a very simple 
form—aside from the instantaneous Coulomb interaction between 
particles, we have no more than the equations for two transverse 
waves for each value of k, the amplitude of each being a harmonic 
oscillator driven by the component of current in the corresponding 
direction. That is, if we choose two directions perpendicular to k, 
say, 1 and 2, and call the components of ax in these directions a1, and 
aax, Maxwell’s equations are 


k?c?ay, = WV 4F Jox (9-21)‏ + يرون 
ûax + kas = V 4r jox (9-22)‏ 


where jix and jax are the components of jx in these directions. (Why 
do we not need to say “of j?) 


The Least-action Principle. The hypothesis of quantum elec- 
trodynamics! is that the oscillators defined in Eqs. (9-21) and (9-22) 
are quantum oscillators. To carry out the quantization, we must find 
the principle of least action which gives these defining equations of 
motion as well as the equations of motion of the particles in the field. 
The action is 


S = S1 + So + S3 (9-23) 
where 
S= 5 f lalla (9-24) 


1 


is the action of all the particles, disregarding the field (if there are 
nonelectric forces between the particles, they are to be included in S1), 


8: = f [RDR = = i(R,1) A(R) | dR dt 
(9-25) 
= X e f (elat - Fax Alar.) a 


It should be pointed out here that some physicists use the term “quantum 
electrodynamics” to include electron-positron pair theory. In the present chapter 
we do not cover such problems. So for us, quantum electrodynamics means the 
quantum theory of the electromagnetic field. 
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is the action of interaction of field and particles, and 


pees 2_ B)gRaG = wy. 2 OA? 
وق‎ = g- | Œ ردم‎ aR dt = =f || و‎ 


- |7 x al] dR dt (9-26) 


is the action of the field. The variables are A(R,t), ¢$(R,), and 
q:(ċ). 


Problem 9-4 In Sec. 2-1 we discussed the mechanisms for obtain- 
ing the mechanical equations of motion from the form of the action 8 
by obtaining the extremum Sa under the condition 6S = 0 for varia- 
tions of coordinates, ôq. Show how Maxwell’s equations can be 
derived from the action S defined in Eq. (9-23) by requiring 6S = 0 
for first-order variations of A and ¢. 


Since the dynamic equations are simplest in terms of the variables 
ax, it is worthwhile to express the action in these variables. Substitu- 
tion of the expansion given in Eqs. (9-14) into S; gives 


Ok تر‎ NOA dt 
S3 5 =f ( ax + e TE C |k x axl ) (2r)? 
} dk dt 
(wE 1 س‎ at . åk = KCB ° ax) (Qn)! (9-27) 
and §, becomes 
dk dt 
= f (p-xÖx — V 4r j-r + ax) جد‎ r)? (9-28) 
Upon substitution of يرف‎ = 4rp,/k?, the terms in ¢x in Sz and S; add 
to give 
ye Ar PxrP—k ak NE eo 9-29 
pas 2 k? (2r)? D 2 lq: — (9-29) 


using Eqs. (9-16), since f(4r/k?)[exp (ik ° R)] dk = 1/R. This is 
just the Coulomb interaction between the charges, which is usually 
considered in analyzing atoms when electromagnetic radiation effects 
are neglected. That is, we shall include this interaction in the action 
Smat Of the matter, 


= oe ve 9-30 
Sant د‎ a) sii 
and write S = Smat + Sint + Sraa. We have thus divided the actions 
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of the electromagnetic field S; into two parts; one contributes to an 
instantaneous Coulomb interaction, and the remainder we shall call 
the radiation field S,aa. (The radiation field takes care of all correc- 
tions to the instantaneous field, such as, for example, that the total 
effects are retarded and act no faster than the speed of light.) The 
action of the radiation field is S; less the terms involving .يرف‎ That is, 


A er ernst x fot dk dt 
Sraa = 5/ (ûîkûık — kcat + يرون‎ — kc?az,,dox) Or Qr)? (9-31) 


which is just the action of the radiation oscillators. The action of 
interactions of these oscillators with the particles is 


dk dt 
بزوع]‎ 


Clearly, the variation of the total action of S with respect to the aix 
and as gives the equations of motion (9-21) and (9-22). 
Written more explicitly, the action Sinz is 


dk dt 
(2r) 


where qı; and qo; are the components of q; in the direction transverse 
to k. Thus all the laws of nonrelativistic mechanics and of electro- 
dynamics are contained in the proposition that S, the sum of Eqs. 
(9-30), (9-31), and (9-33), is stationary for variations in the paths of 
the variables q;(t), aix(t), and ax(t). Quantum electrodynamics 
results from integrating eS over these paths, and it is described in 
the next section. 


Sins = V 4r if (jı-k@ıx + 0و3‎ ( (9-32) 


(9-33) 


Sint = Vin) 0 (dıdı; 3E 0 كلاج زرو )يرو‎ ® 
7 


THE QUANTUM MECHANICS OF THE RADIATION FIELD 


We begin by discussing the quantum mechanics of the radiation field 
in empty space. There is no matter present, so the total action is that 
of the radiation field along 


S = 8 rad (9-34) 


as given by Eq. (9-31). It is evidently the action of a set of harmonic 
oscillators. We have seen some examples of expressions like Eq. 
(9-31) in Chap. 8. We make the assumption discussed in Chap. 8 
that quantum electrodynamics results from considering these as 
quantum-mechanical oscillators. 

The modes of our system are running waves, two for each value of k 
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(polarization 1 and 2) with frequency w = ke. For one of the modes, 
Say, ix, the available energy levels are 


Eix = (nx + )hke (9-35) 


where nix is any positive integer or zero. 

If mix = 1, we say there is one photon present of polarization 1 and 
momentum ñik; in general, we say nıx such photons are present. The 
energy of a single photon of this kind is 7. 


Problem 9-5 The momentum in the field is given by (1/4rc) 
X JE X B d(vol). In the absence of matter (so ¢x = 0), show this 
is fk(ař - a.) dk/ (2r). 


Later on when we consider the interaction of matter with the radia- 
tion field, we shall find that the matter absorbs or emits one photon at a 
time of energy ñw. Of course, this is the same as Planck’s original 
hypothesis. 

It is quite striking and surprising that the states n of the oscillators 
can also be described by imagining that there are n “particles” or 
“photons” present. It is clear, of course, that the energy values agree. 
But there is one further subtle point that must be noted before the 
oscillator states can be completely successfully described as particles. 
Suppose, for example, that just two of the n; differ from zero, say, 
Na = 1,m=1. This single state we may wish to represent by saying 
that we have one photon in state a and another in state b. But at 
first sight this way of speaking might seem to imply that there were 
two states available, both of the same energy. For we could also 
expect to be able to put the first photon in state b and the second in 
state a. The way out of this can be seen when we consider the exam- 
ple of a particles. Suppose we have two a particles with coordinates 
x and y, and say the z particle is in a state represented by f(x) and 
the y particle is in a state g(y). Thus the wave function for the system 
would be 


¥(x,y) = f(x)gy) (9-36) 


a function of the two variables x, y. But another state might have 
y in the state f and z in g, leading to another state of wave function 


(zy) = g(x)f(y) (9-37) 


which differs from the first. But if the particles are truly identical, 
like a particles, the two states are indistinguishable. As we described 
in Chap. 1, it turns out to be a rule of quantum mechanics (not deriva- 
ble from the Schrödinger equation) that for a particles the amplitudes 
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for two cases which differ only by exchange of the æ particles must 
always be added. The only allowed wave function is in this case 


v(2,y) = f(z)gy + 9@)FY) (9-38) 


(suitably normalized: if f and g are orthonormal, the factor is 1/ له‎ 2: 
if f = g and they are normalized, it is 14). In general ¥(z,y) = W(y,2) 
for a particles and for other particles obeying Bose statistics. There 
is, for such particles, only one state: one particle in state f, the other 
in state g. 

It turns out that all the results are consistent if, when we consider 
oscillator excitation states as representing numbers of photons, we 
also say that photons are Bose particles. Then the single state 
Na = 1, n, = 1 represents the situation that there are two photons, 
one ina, one in b. Exchange does not produce a new state. 

For electrons of parallel spin or other Fermi particles we must 
subtract the amplitudes when the identity of the particles is: reversed 


(zy) = S(x)g(y) — gæfu) (9-39) 


The wave function ¥(2,y) = —¥(y,z) is antisymmetric in general for 
Fermi particles. This is, of course, also just one state. But for 
Fermi particles, two particles cannot occupy the same state. If we 
put f = g into Eq. (9-89), we get zero. Two photons, like two a par- 
ticles, can occupy the same state; for photons it corresponds to the 
n = 2 oscillator states. 

There is one particular situation with matter present which, in the 
ideal case, can be handled nearly as simply as the matter-free case. 
That is the case of a cavity resonator (or a wave guide) where the walls 
may be idealized as perfect conductors. Then classically, as is well 
known, there are a number of possible oscillator modes with more or 
less complicated distributions of electric fields. The classical action is 
then reducible to a set of free oscillators, but the variables now repre- 
sent the amplitudes of the various modes, rather than the amplitudes 
of plane running waves. These oscillators are then analyzed as quan- 
tum oscillators, and we speak of the number of photons in each mode. 


THE GROUND STATE 


Vacuum Energy. The state of the electromagnetic field of lowest 
possible energy, which we shall call the ground state.or the vacuum 
state, is that in which there are no photons in any mode. This 
means that the energy in each mode is fiw/2, where w is the frequency 
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of the mode. Now if we were to sum this ground-state energy over 
all of the infinite number of possible modes of ever-increasing fre- 
quency which exist even for a finite box, the answer would be infinity. 
This is the first symptom of the difficulties which beset quantum 
electrodynamics. 

In the present case, for the vacuum state, the trouble is easily fixed. 
Suppose we choose to measure energy from a different zero point. 
Since there is no physical effect resulting from a constant energy, 
the result of any experiment we perform will be insensitive to the 
arbitrary choice of the zero point in energy. Therefore, we assign to 
the vacuum state the energy zero. Then the total energy in any state 
of the electromagnetic field is given by 


E= > njw; (9-40) 


3 


where the sum is taken over all the modes 7 of the field. 

Unfortunately, it is really not true that the zero point of energy 
can be assigned completely arbitrarily. Energy is equivalent to mass, 
and mass has a gravitational effect. Even light has a gravitational 
effect, for light is deflected by the sun. So, if the law that action 
equals reaction has qualitative validity, then the sun must be attracted 
by the light. This means that a photon of energy fiw has a gravity- 
producing effect, and the question is: Does the ground-state energy 
term ňw/2 also have an effect? The question stated physically is: 
Does a vacuum act like a uniform density of mass in producing a 
gravitational field? 

Since most of the space is a vacuum, any effect of the vacuum-state 
energy of the electromagnetic field would be large. We can estimate 
its magnitude. First, it should be pointed out that some other infini- 
ties occurring in quantum-electrodynamic problems are avoided by a 
particular assumption called the cutoff rule. This rule states that those 
modes having very high frequencies (short wavelength) are to be 
excluded from consideration. The rule is justified on the ground that 
we have no evidence that the laws of electrodynamics are obeyed for 
wavelengths shorter than any which have yet been observed. In 
fact, there is a good reason to believe that the laws cannot be extended 
to the short-wavelength region. Mathematical representations which 
work quite well at longer wavelengths lead to divergences if extended 
into the short-wavelength region. The wavelengths in question are 
of the order of the Compton wavelength of the proton; 1/27 times this 
wavelength is i/Mc ~2 X 10-14 cm. 

For our present estimate suppose we carry out sums over wave 
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numbers only up to the limiting value kmax = Mc/h. Approximating 
the sum over states by an integral, we have, for the vacuum-state 
energy per unit volume, 


a oon he nek ee 
unit vol  2(2r)3 87? 
(Note the first factor 2, for there are two modes for each k.) The 


equivalent mass of this energy is obtained by dividing the result by 
c?, This gives 


i pant Ts Anh dime (9-41) 


g/cm? (9-42)‏ 2.1015 = كك 

Such a mass density would, at first sight at least, be expected to 
produce very large gravitational effects which are not observed. It is 
possible that we are calculating in a naive manner, and, if all of the con- 
sequences of the general theory of relativity (such as the gravitational 
effects produced by the large stresses implied here) were included, the 
effects might cancel out; but nobody has worked all this out. It is 
possible that some cutoff procedure that not only yields a finite energy 
density for the vacuum state but also provides relativistic invariance 
may be found. The implications of such a result are at present 
completely unknown. 

For the present we are safe in assigning the value zero to the vacuum- 
state energy density. Up to the present time no experiments that 
would contradict this assumption have been performed. As we pro- 
gress further into the field of quantum electrodynamics we shall find 
other divergent integrals which are more difficult to circumvent. 


Vacuum Wave Function. The wave function for the set of oscil- 
lators is just the product of the wave functions for each mode. For 
the ground state the wave function of the oscillator 1, k is propor- 
tional to exp [—(ck/2h)/aj,axx], so the wave function for the entire 
system in the ground state, or vacuum state, is, within a normaliza- 
tion constant, 


k 
o = exp | - J $ (afan + afan) | (9-43) 
k 


Problem 9-6 Show, using sine and cosine modes and real variables, 
that this expression using complex variables is indeed correct (cf. 
Prob. 8-4). 


Problem 9-7 Show, for the vacuum state, the mean value of aj,au 
is (i/2kc)8,. Develop a formula for the mean of (a},aix)" for inte- 
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gral r and explain thereby how the mean of such quantities as 


. (af,a1x)"(af,a1p)* can be got for p Æ k. Show that the mean of (aix)? 


or (aì)? vanishes. Show that the mean of the product of any odd 
number of a’s is zero and that you can compute the expectation value 


١ of any product of a’s or a*’s for the vacuum state. 


Problem 9-8 For the state for which there is just one photon 
present in state 1k, all of the factors in the wave function are $o except 
one, which is .رف‎ But for an oscillator ¢1(q) = q%0(9). The wave 
function representing an excited running wave is a linear superposition 
of the state with the cosine mode excited and 7 times the state with the 
sine wave excited, so show that the wave function for just one photon 
present in state 1k is aiko. This is not normalized. The normaliza- 
tion is [®ja1,a},Po, or the expectation of axañ for the vacuum, which 
we have seen in the preceding problem is 7/276. Hence the normal- 
ized one-photon state is ~/2kc/h .ويه‎ 


INTERACTION OF FIELD AND MATTER 


To deal with the interaction of the radiation field with matter is not 
difficult in a formal way. Evidently from the action expression of 
Eqs. (9-30), (9-31), and (9-33) we see we must deal with the matter 
system interacting with the radiation oscillators and must calculate 
amplitudes from 


Amplitude = ri | exp E (Smat + Sing + Su) || I] Dq: Dai, Dak 
ik 
(9-44) 


The coordinates of the radiation oscillators can be integrated out 
immediately; for they appear only in quadratic expressions. We 
shall do this integration in the next section. 


Emission from an Atom. Part of the complication in this 
problem is simply the confusion produced by so many coordinates and 
states. So we shall begin by dealing with a simple problem just to 
get more used to what is involved. We shall solve the problem of the 
probability of emission of light by a single atom, using perturbation 
theory (assuming the interaction Sin: of light and matter is small and 
expanding it only to the first order). 

If Sin: is neglected, the radiation and matter are independent 
systems. Let the states of the atom alone have energy ey for various 
values of N with wave functions Yy(q), where q represents the q: of 
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all the particles of the atom. The state of the radiation can be defined 
by giving the values of all the integers nu, and na. The energy levels 
of the combined system are 


E = en + 2 (nix + nox)hke (9-45) 


The wave function for this state is a product 
Y = wn (q) (Nik Nox) (9-46) 


where &(11x,%2x) is the wave function for the radiation-field (a product 
of harmonic oscillator wave functions). 

To deal with atomic radiation of a photon, we consider as the 
initial state that the atom is in some level M and no photons are 
present (all nıx and nax equal 0). This wave function is 


Y; = Yu (qo (9-47) 


with o from Eq. (9-43). In the final state the atom is in another 
level N, but now a photon is present, say of momentum / and polariza- 
tion 1. According to Prob. 9-8 the wave function of the radiation 
alone is زو ته‎ the complete final wave function is 


Wee ee Vw (qa ho (9-48) 


Now to find the transition probability per second (to first order) we 
see, according to Eq. (6-79), we shall need the matrix element V,; of 
the perturbation potential between these states. The perturbation 
action is Sins, as defined in Eq. (9-32), and the corresponding potential is 


y = V4 D (aikJık 3 0:11 ix) (9-49) 
k 


where ji, depends on the atomic variables, as in Prob. 9-2. This 
matrix element is 


Vr = | VEY عل‎ an) VF (aiji + anit vache dg [ [ dan 
E k 


(9-50) 
which gives 


£ 2 ع‎ ji fane. |] dan l Viji m dq 


8 l : 
IR 2 T > Dp AA1xPo 1 daik / يرطي ذل‎ dq (9-51) 
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because only j depends on q. The expectation values of the a’s for the 
vacuum state were worked out in Prob. 9-7, that is, 


fi P aua Po II 00 11 - 0 
k 


unless k = J, in which case it is h/2lc. Let us write the matrix element 


Svxivau as (j)ru. Our matrix element is therefore V Int /le (ju) m- 
The probability of transition per second is then [ef. Eq. (6-94)] 


6 (=) lJualwar26(E m PI Eyn = hle) (9-52) 


Ordinarily we are not interested in the problem of exciting one 
particular photon but would rather see the probability of emission of 
any photon (of the polarization 1) into some small solid angle d0. We 
must sum 1 over all values which correspond to this direction. The 
number of values of 1 per unit volume is d*l/(2r)’, or if 1 is in the 
specified direction, we require the integral over dl of 1? dl dQ/(2m)%, so 
that we find that the probability of a transition per second is 


dP Pd ain dl dQ 

ae / oe lJulwa?6(Eu — Ey — - 2 Qn) (9-53) 
The integral on / gives 

dP : 

UE T مووي‎ Irla’ 9 (9-54) 


for the rate of emission of light of polarization 1 in direction 1 into the 
solid angle d2. The frequency emitted satisfied 


Ey — Evn 


7 (9-55) 


w = lc = 


Problem 9-9 For a complicated system moving nonrelativistically 


(ju)vu = 2 (ese + هون‎ #9) way (9-56) 
where e is a unit vector in the direction of the polarization of the light 
and e and q» are the charge and position of the bth particle. Assume 
the wavelength of the light is very large compared with the size of the 
atom, i.e., that the absolute square of the wave function describing 
the position of the bth electron falls to 0 over a distance small compared 
with 1/k. Show that we can then approximate exp (zk ٠ q,/h) by unity 
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and write the matrix element as 


JIK,NM = Wes UNM (9-57) 
where 
uvm = 2, (eva) var l (9-58) 


The function uym is called the matriz element of the electric dipole 
moment of the atom, and the approximation used to derive Eq. (9-57) 
is called the dipole approximation. Show that the total probability 
to emit light in any direction per unit time is 


dP 4o’ 

= luxul? (9-59) 
[Integrate Eq. (9-54) over all directions, remembering that e is perpen- 
dicular to k and that there are two possible directions of polarization.] 


Elimination of Electromagnetic Field Variables. Since the 
radiation field is represented by a quadratic action functional, we can 
integrate out all its coordinates. We shall do so here. We must 
integrate all the variables aix, asx in Eq. (9-44). We must specify 
the initial and final states of the radiation field. First we shall take 
the simplest case that initially and finally we have a vacuum, the 
oscillators all going from 0 to 0 photon number. Our amplitude can 


be written 

Amplitude = fe SnaX[g] Dg (9-60) 
where 

X[q = 1 el Sint Sead) |] dare dazi (9-61) 


k 


is a functional of the q’s which appear on the right-hand side of the 
equation in the current variables j. Since the action is a sum of 


contributions > (Six + Sex) from each mode, where 
7 


2م2 
s= | | viz (ja* + j*a) + gata — “Fata — =| 1 aes:‏ 


clearly X is a product of corresponding factors. The integral for one 
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typical mode, 
Xir = 1 {exp E f (VE itan + Vir jar, FE Yo عن يز‎ 
2م2‎ 
ate eee! ف‎ a| n 


exp |- FA sy 2001500 5 xy e` tkelt—s| dt ds | (9-63) 


is a type of path integral which we have already done many times, 
except for the complication of complex variables, which can first 0 
reduced to real variables. In fact, this is exnetly the problem dis- 
cussed in Sec. 8-9. The interaction no y(t) of Eq. (8-136) is here 
given by y = Vår jıx and w = ke. The final expression of Eq. (9-63) 
is equivalent to Eq. (8-138). The product of such factors for each k 
and polarization gives X = e//*, where 


I = D J ORO + JOIA s edt ds (9-64) 


Thus the problem of a vacuum-to-vacuum transition is completely 
solved in terms of a path integral on the matter variables alone : 


Amplitude = f e/GmctD )و2‎ (9-65) 


We shall discuss a number of consequences of this result. (The case 
that the initial or final state is not a vacuum is described in the next 
section.) 

It appears that the net result is PERN this: The matter acts not 
with Smat but with a modified action Shar = Sma + I. The modifica- 
tion results from a reaction with the electromagnetic field. This is 
not true in a strictly classical sense, for the action 7 is a complex num- 
ber. It can be shown that the classical physics which results from 
using the principle of least action, with the real part of Shar only, is 
exactly equivalent to the combination of Maxwell’s equations and 
Newton’s laws. But it does not correspond to the case that Maxwell’s 
equations are solved by using just retarded waves. (In fact, a restric- 
tion to retarded waves cannot be represented by any principle of least 
action in which only matter coordinates appear. Instead it cor- 
responds to using half the advanced and half the retarded solution.) 
Our full quantum-mechanical complex expression for J is correct, 
and we shall now look at its consequences. 


1J. A. Wheeler and R. P. Feynman, Rev. Mod. Phys., vol. 17, p. 157, 1945. 
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First-order Perturbation Expansion. The integral on q is too 
complicated to do exactly, but in the expression for the currents in 1 
the charge e of the particles occurs. Thus J is proportional to e?, 
which in dimensionless form is the fine-structure constant 


Crh 1 

hc 137.09 

a small number whose exact value and meaning are unknown except 
experimentally. Thus we may expect that the effect of I is small. 
We already know that the Schrédinger theory gives atomic levels, for 
example, quite accurately. There can be only small errors arising 
from the neglect of J. Let us look at the effect of J in first order in رثع‎ 
corresponding to second order in e on the original action of Eq. (9-32). 
Let us take the transition amplitude Amm as defined in Sec. 6-5, where 
the matter system begins and ends in state M. If 1 is neglected, the 
zero order is 


Naru? = e (iDEut (9-66) 


The first-order term is 
1 ; 
daca! = 5 f VDE Tula) D40 
1 i ٠. . 
E D J Vile Sne | PAORO) 
k 


+ 26) [(8)م‎ 5 eetl dt ds Yula) DaO) (9-67) 


Now we take t > sand double the result. The evaluation of a similar 
expression was worked out in Sec. 5-1. For the present example, we 
get 


Aum) = — z (AE)Te—EuTIà 
where 


AE = 2 2, = (Ji) un (Jinu + (J?) ew (Jon) a] 


x ie e(ilh) (Eu—Ey—hike)r dr = 2 1 [kya 


dîk 


+ [Gadu ]2ke(Eu — Ey — khe + te) a; 


(9-68) 
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This has a real and imaginary part and can be written as 


AE = 3p - لد‎ 


The real part ôE represents a small shift in the energy levels of the 


atom, called the Lamb shift. Such a shift was discovered experi- 
mentally by Lamb and Rutherford. This is 


= f ]|)2 “ادل‎ 


: 477 dk 
+ |(jox)warl?]P.P.(Eu — En — hike) Ske Oni (9-69) 
and the imaginary part is 
yl 2 j [| (Jax) “ايد‎ 
: Arh dk 
+ عو )جل اعد (ءو2)|‎ — Ey — hike ¢) arc Gn (9-70) 


The amplitude that the atom remains in the upper state with no pho- 
tons emitted goes as exp [—7(H» + ôE — iy/2)T/R] and the probability 
as e 7T". That is, the probability to remain in the state M decreases 
exponentially with the rate decay y. Physically it should decrease 
because the atom in the state M can emit a photon and fall to a lower 
state N. Comparison with Eq. (9-53) shows that y in Eq. (9-70) is 
indeed the total rate of transition from state M to all lower states N. 


A SINGLE ELECTRON IN A RADIATIVE FIELD 


The Energy Correction. In order to study the electromagnetic 
energy correction 8H, we shall consider the simplest case: that in which 
the matter system has only one moving charge (e.g., a hydrogen atom 
with an infinitely heavy nucleus or a free electron in empty space) 
whose coordinates we shall call R. Thus j = eR exp (k-R/h). We 
have here a case where j contains R, and in considering second- 
order terms we must take appropriate care, as discussed in Sec. 7-3. 
There is an additional term to ôE from the TT velocity term R?. 
Expressing R in terms of the momentum operator p, as in Sec. 7-5, 
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we obtain 
dîk 15 Aner 
ôE = » kcr)? (pie ia + peia) m?(Eu — Ev — tke) 


4re? 7# dk 
al SRO E 
Problem 9-10 Why do we not need to be careful to write 
Ie[pı exp (~ik ٠ R/h) + exp (—ik ٠ R/R)pı] in the matrix elements? 


Let us take the simplest case of a free electron at rest. Any SER we 
get for energy in the field will represent a correction to the rest energy, 
or as can be shown from the relativity theory, to the mass, ôm = 6Ep/c’. 
This is the so-called electromagnetic mass correction. For a free par- 
ticle at rest, the states are plane waves. If the momenta in M and N 
are pm and py, the matrix element [pı exp (—7k- R/h)]wm is zero 
unless py = pm — K, in which case it is .سيم‎ Thus for an electron at 
rest initially, the matrix element is 0 and 6H is just the last integral of 
Eq. (9-71), which is infinite! 


Short-wavelength Difficulties. This is not the whole of it. 
When we eliminated the term prp:/2k? in Se, we pointed out that this 


represented the interaction between point charges D e:e;|q; — qil, 
21 

but neglected to point out that the infinite terms 2 = 7 must also be 
included in the sum. Indeed, for a single particle p, = e exp (ik ° q), 
so 14|px|2/k? = 4re?/2k? and the term is 4re?f (15k?) d3k/(2r)3. The 
infinities here and in 8ER above do not cancel, and we are left with a 
real difficulty; our integrals over momentum k diverge quadratically. 
Quantum electrodynamics gives nonsensical results. 

It is true that we are using a nonrelativistic treatment of the charged 
particle. The relativistic treatment of the matter (quantum elec- 
trodynamics is not altered) does not rid us of the divergent results, 
although the order of infinity may be changed. For a particle of 
spin 0, like a ع‎ meson, the order is unchanged; it is still a quadratic 
divergence. Here there is presumably an experimental value of the 
mass correction available. As far as is known, through other inter- 
actions, the sole difference of charged and neutral ع‎ mesons is the 
charge, i.e., the different way they couple to the electromagnetic 
field. So presumably the mass difference of the charged r meson with 
a mass Mr of 273.2 electron masses and the neutral r mesons of 264.2 
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electron masses, that is, 9.0 electron masses = 0.034m,, or 4.6 Mev, 
represents energy in the electromagnetic field. 

If we arbitrarily stop our integrals at some higher momentum kmax 
(which is not a relativistically invariant procedure), we get an energy 
e?(Kmax)?/2rm,c from the last term of Eq. (9-71), which is the largest 
term if ñkmax/c is very much larger than the pion mass m,. If this 
equals 0.034m,c?, then (e?/2mlic) (ikmax/mrc)? = 0.034, or 


k 35 5.4m,c 5 0.8Mc 


where M is the mass of a proton. (The relativistic theory gives 
AE = 0.034m, with a cutoff at about the same energy.) It is for this 
reason that we conclude that our present-day formulation of quantum 
electrodynamics (or of the “particles” with which photons interact) is 
faulty. The fault lies in the way we deal with energies beyond proton 
mass or with corresponding frequencies, or wave numbers. The 
difficulty arises with modes whose wavelength is less than about 
4r X 10-14 cm. 

For the electron of spin 14 the Dirac theory shows that the electron 
should have a certain magnetic moment. It turns out that with such 
a magnetic moment the negative magnetic energy almost perfectly 
cancels the positive electric energy. The difference still diverges, 
although only logarithmically. -If a cutoff is applied to integrals over 
wavelengths, at the wavelength limit suggested above, the correction 
to the electron mass is only about 3 per cent, but there is no way to 
test this, for we do not recognize a neutral counterpart to the electron. 

For the proton the anomalous magnetic moment is so high that the 
magnetic energy exceeds the electric energy and the correction can be 
negative. The neutron is a magnet, too, so its correction is also 
negative. Since the proton moment is higher, the fact that the neu- 
tron is heavier than the proton might be explained. If the integrals 
are cut off at an energy of the order of a proton mass, the difference 
comes out correctly, but this is too crude a way to calculate such an 
accurately known energy as the 782.71 + 0.40 kev} equivalent to this 
mass difference. These mass differences (of proton and neutron, of 
charged and neutral pion, of positive, neutral, and negative sigma 
mesons, and of charged and neutral K mesons, etc.) present a serious 
challenge to modern physics and possibly point to the failure of quan- 
tum electrodynamics to give us a complete theory for calculating 
electromagnetic effects. We do not know whether it is truly quantum 


+ Page 354 of Everling et al., Nucl. Phys., vol. 15, p. 342, 1960, 
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electrodynamics or our assumptions about the distribution of charge 
inside the particles which are at fault. Only when we have a more 
complete theory of these particles and their interactions will we be able 
to determine the limitations, if any, of our present theory of quantum 
electrodynamics. 


THE LAMB SHIFT 


According to the Schrédinger equation, the second level of the hydro- 
gen atom is degenerate. The 2s and 2p levels occur at the same 
energy. Likewise, for the Dirac equation there is a degenerate pair 
2s,4 and 2py. But Lamb and Rutherford found in 1946 that there is 
indeed a small separation (about 1 part in 3 X 10°) with the 2s, 
lying higher by a frequency of 1,057.1 megacycles. 

Although theorists reasoned that such an energy difference might 
arise from effects of the term J, the infinities of the divergent integrals 
confused all attempts to calculate the difference until the work of Bethe 
and Weisskopf in 1947. They reasoned as follows: 

First, since 


1 1 Ey — Ey 1 


Eu — Ey — tke ike Ey — Ey — fike REC (9-72) 
the energy (9-71) was expressed as the sum of three terms, 
6H = 6H’ + ôE” a ôE” (9-73) 


where 


, _ 2re f Ck (Eu — En) (pie = iu + pe Fly) 
ers m?c? Gaye 2 Ey — Ey — ħkc Mail? 


A 2e? f dk 9 : 
ie eae one ) (pie ve + [poe im) (9-75) 
Ws wm 2reh dk 
aT (27) *k (9-76) 


The term ôE” and the infinity from the Coulomb term ôE. are 
independent of the state of the electron. They would (it was hoped) 
be made finite in some future theory. It would contribute some 6m to 
the rest mass of an electron. If mo is the mechanical mass, the true 
experimental mass m = mo + ôm, where émc? = ôE” + ôE.. In 
the total energy (including the rest energy of the particles and the 
binding energy) of the hydrogen atom, such a rest-energy correction to 
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the energy is, of course, expected, but we have already included it 
when we measure all binding energies relative to the free-particle 
ionized state. The 6m term is thus identified, because it is the only 
term for an electron at rest, and it is independent of the motion or state 
of the electron. 

The term ôE” could be simplified; for the sum on N could be taken 
to give (pı? + ,ررر(ثوم‎ (by the law of matrix multiplication). When 
k is integrated over all directions, this becomes 24 (p ٠ P)uu, and 


Shi a = (p 7 P) wu Sre? dêk 


2m 3me? J (2r)3k? (9-77) 


Again it was hoped that some day this term would be finite. It 
exists even for a free electron. It is interpreted as follows: The 
mechanical kinetic energy p*/2m would be altered (if the mass is 
altered) to the expression 


BPD (1 É z) (9-78) 


and the term 6#’” must represent — (p?/2mo) 5m. But we have 
already taken this term into account; for we calculate the Schrödinger 
energy levels with p?/2m, where m is the experimental mass. The 
term is identified because it is the only extra term for a moving free 
electron and it is proportional to the kinetic energy.! Finally, even 
though these terms may be interpreted wrongly, when we calculate the 
difference of the values of ôE for the 2s and 2p states, the terms will 
drop out, because 6H’” and ôE, are the same for all states and ôE” is 
also the same, since (p?/2m) ym turns out to be the same for the two 
states 2s and 2p. 

In the final term ôF’ the argument was made that the dipole approxi- 
mation would suffice. Then the matrix elements are independent of 
k, and since 


dk 1 Ar 0 
ل ا اه‎ Be co 
we get 
8H’ = w elsi 1 In لف وليك‎ (Eu — Exn)%|Pxu|? (9-80) 
rm?hc? £ Eu — Ey 


1The ôm implied by Eq. (9-77) is ( 8re?/3c?) fd*k/k? and is not equal to the 6m 
obtained from the 6H#/c? for a static electron. This is because we are limited to a 
nonrelativistic approximation. When a fully relativistic analysis is carried out, 
the two ways of calculating ôm agree. 
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Since the states and the matrix elements are known for hydrogen, 
the sum can be worked out. The only question is the value of hkmaxc. 
Bethe argued that the nonrelativistic approximation is at fault here 
and that, if the full relativistic calculation were made, fikmaxc would 
turn out to be of the order mc. Putting fikmaxc = mc? gave about 
1,000 megacycles, so Bethe knew-he was on the right track. 

The remaining problem was to make a relativistic calculation with 
the Dirac wave function and states. Only in this way could a precise 
determination of the effective kmax be made. This turned out to be 
quite confusing, for it was hard to identify the various infinite terms. 
It would not do to simply cut them off at some maximum momentum 
and take the difference; for it is not necessarily a relativistically 
invariant procedure, because it deals with momentum and energy in 
different ways. (One consequence of this has already been pointed out 
in the footnote.) One method for resolving the confusion was devel- 
oped by Schwinger, who showed how the relativistic symmetry could 
be kept clear throughout the calculation and the infinite terms identi- 
fied. Another method worked out by Feynman was to give a rela- 
tivistically invariant procedure to cut off the infinite integrals. Here 
we shall illustrate the latter method. 

The total effect of the electromagnetic field, which this time includes 
the Coulomb interaction, is represented by an extra term J + S, in 
the action. The relativistic invariance of an expression for J like Eq. 
(9-64) will not be self-evident, since that formula is expressed in terms 
of k and t instead of either R and t or k and w. Let us represent J in 
terms of frequency w and wave number k variables. First note that 


2ike‏ ا 
—ikeir|o—iwr = S‏ 
E SERE tec)‏ 
or‏ 
dw/2r ay‏ 211:6 1 
eT ese (9-82)‏ 


Suppose we define 
j(kw) = Six(ther* dt 
= SJj(R,e-**-#9 dR dt (9-83) 


Then J becomes 


T= - اا | مو‎ lt d'k do 


w?— We? + ie (27° GAS. 
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The relativistic symmetry of this expression in w and k is already 
clear; for w? — kc is invariant to the Lorentz transformation. The 
currents, however, do not appear in a relativistically symmetrical 


manner. We would have expected an invariant combination like 
cp” — j‘ j, since pc and j form a four-vector. But if we define 


p(K,w) = Spx(ietit dt 
= Jfp(R, te) dR dt (9-85) 


the Coulomb portion of the action is 


2 k A Kos 
es ]م‎ hee ee oe te (9-86) 


the last resulting simply by multiplying numerator and denominator by 
w/k? — c?. But the law of conservation of current 


OR تع‎ 

b - y.j (9-87) 
becomes 
wplk,w) = k۰ j(k,w) (9-88) 


Alternatively, if we call j the component of j in the direction of k, 
wp/k = و[‎ and we have altogether 


l - 5 
1 + مو = رو‎ f “لسكلا‎ + Lior ct 1)) - eo) 
dîk dw 


Gree Se 


x 

The sum of the three j terms is just j(k,w) - j(k,w), and the four- 
dimensional invariance is evident. 

A suggestion is now made that in view of our present ignorance, 

convergence of the integrals can be made artificially by supplying an 

additional factor such as 


—A2 2 


in the integrand of Eq. (9-89), where A is some very high frequency. 
For small values of w and 7: this factor is unity, whereas for high values 
it cuts off the integral. Furthermore, such a factor clearly does not 
destroy the relativistic invariance of the expression. All physical 
quantities are to be calculated by assuming J + S, contains this cutoff 
factor. If they are insensitive to A for large A (like the Lamb shift), 
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the theoretical value is to be trusted. If, on the other hand, the result 
depends sensitively on A (as does the charged and neutral pion mass 
difference) no quantitative meaning can be given to the result, for the 
cutoff function is arbitrary and is not completely satisfactory. This 
is the present state of quantum electrodynamics. 


Problem 9-11 Show that the cutoff function is not completely 
satisfactory by arguing that y calculated in the manner of Sec. 9-4 
would be altered by the cutoff, yet the probability of emission of a 
real photon would not be so altered because for such a photon w = kc 
and the modifying cutoff factor is exactly 1. Thus the balance of 
probability would not result (i.e., the probability that the atom emits 
plus the probability that it does not emit would no longer add to 
unity). The difficulty suggested by this problem has never been 
solved. No modification of quantum electrodynamics at high fre- 
quencies is known which simultaneously makes all results finite, main- 
tains relativistic invariance, and keeps the sum of probabilities over all 
alternatives equal to unity. 


Problem 9-12 Transform J + عكر‎ into space coordinates by using 


| فر س‎ dîk 24)/س0‎ _ i 
Or) o? — We? + i) (Pe? — 72 i) Cr) 


1 
= 12 ô, (tc? = R?) (9-90) 


[Note: The function —ri/(x — ie) is often written as 8,(z), and we 
have introduced that convention here.] Then find 


I + 8, = z; f EORR) — Rati) Rato) 


X ê,[(tı — t2)2c? — [Ri — Ro] dR, d'Re dt; dta (9-91) 


THE EMISSION OF LIGHT 


In Sec. 9-4 we found an expression for the amplitude that the matter 
system would do something when interacting with an electromagnetic 
field, as shown in Eq. (9-90) and the following development. This 
derivation was restricted to the special case that the field is both 
initially and finally in the vacuum state with no photons present. The 
result was that the action Sma: in the path integrals must be replaced 
by an effective action Shat = Smat + I. In a more general case, 
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photons were present, both initially and finally. As an illustration 
suppose that initially no photons are present but in the final state 
there is just one photon of momentum AL and polarization 1. The 
only change which this makes in our previous calculation is the change 
in the integral defining S, that is, Eq. (9-61). We shall now use 


XE! = 2 f elI) (Smart Sraa) Day Dax (9-92) 


where the path integral is carried out between a vacuum initial state 
and final state consisting of a vacuum plus 1 photon. Then every 
oscillator, except 1L, goes from the initial state n = 0 to the final state 
n = 0, so the factor Xx for all of these oscillators is unchanged. Only 
the contribution from the single-oscillator 1L is altered; for it now 
becomes 


es j 6 li J | Wie (Jai + juai) + Ghd — 1020207 تدك‎ 
| al) Darr (9-93) 


This expression is the same as Eq. (9-63) except that the oscillator path 
is taken between the state n = 0 and the state n = 1 instead of n = 0 
to n = 0 as in the previous expression. We worked out the behavior 
of a forced harmonic oscillator in Sec. 8-9, and we can use the results 
of that section to write 


2 a 
i= ( ~ | ine it) Xi (9-94) 


where Xi; is the n = 0 ton = 0 factor previously calculated. There- 
fore, evidently the complete factor X’ is simply the original factor X 
multiplied by 


27 1. piLe 
Te Í Jue dt 
and we find for the amplitude 
Amplitude = 1 i gil (Smart Z) | ine at Dg (9-95) 


The perturbation theory expression which we previously evaluated 
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was equivalent to the transition element 


277 i t 1 iLc - 
Nios / elil) Sma f ine t dt Dq (9-96) 


so we see that the net result is the same as that given by the perturba- 
tion theory with the following èxception. The transition amplitude 
must be calculated with the complete effective action Smat = Smat + I 
instead of just Smat The effect of J is to change the energy levels a bit 
as we discussed, but also to make the energy values effectively complex. 
The result is that the emitted light gives a spectral line which appears 
to have a little width, which is called the natural width of the line. 
We shall not go any further into the details of this calculation but 
leave the subject and the generalizations to a number of photons both 
entering and leaving the system to those who wish to study quantum 
electrodynamics specifically in more detail. 


SUMMARY 


Review of the Approach. In this chapter we have a considerable 
amount of analysis of the quantum electromagnetic field. It is worth- 
while looking back again to see the central ideas and results. The 
separation of the Coulomb interaction and the use of running waves 
are technical ways of accomplishing our ends, but the essential result 
is the formula of Eq. (9-89) [or its equivalent, Eq. (9-91)]. Let us 
review this result from the more general point of view exemplified 
by the ideas of Eq. (9-1). 

Suppose we have a system which can be described by an action 


S = Sı(q) + S2(q,A,¢) + Sa(A,¢) (9-97) 


where S,(q) is the action of the matter alone, و‎ is the interaction of 
matter and field, and S; is the action of the field alone, and where q 
stands for all the coordinates of the matter, with A, ¢ describing the 
field. Then the amplitude for any event results from evaluating an 
integral like 


K = f exp (FISO + Sr(aA,4) + SAAI) Dg DADs (9-99) 


subject to the boundary conditions of the problem in question. In this 
summary we shall assume that conditions for the field are that initially 
and finally no photons are present (i.e., ground state to ground state 
for the field), and we abbreviate this set of conditions as gnd-gnd, 
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Later on we shall consider the consequences of integrating q first and 
A, ¢last. What we have done so far corresponds instead to integrating 
A, ¢ first and reserving the q integral as a subsequent step. 


Usually S2(q,A,¢) is linear in the field variables A, ¢ and can be 
written as 


S2 = flo(R,)o(R,d) — jR,t) ° A(R,¢)] dR dt (9-99) 


where p and j are the electric charge and current density, which depend 
on the q only. The integral on A, ¢ is then easily performed because 
it is a gaussian integral. It is the burden of Eq. (9-91) to tell us the 
value of this integral, namely, 


md 1 
E 6 8 | S:(A,¢) + ji (ep —j- A) dR 1|) DA Do - 7 


)9-100( 
where J, which we called J + S. in Eq. (9-91), is 


J = tf[c%p(Ri,t:)p(Re,ts) — j(Ri,ta) ° j(Ro,te)]64[(tr — te) 2c? 
— [Ri — R,|?] dR, و0358‎ dt; dta (9-101) 


for any functions رم‎ j of R,t. The expression for Eq. (9-101) in momen- 
tum space appears in Eq. (9-89). 

In the applications of Eq. (9-98) these p, j are some function of 0 
and q, so we obtain the result that 


K(gntgnd) = f (exp لكا‎ + s@i}) ود‎ (9-102) 


where the functional of q, J(q) is given by Eq. (9-101) with the 
correct p, j substituted. This summarizes the results for gnd-gnd 
transitions. We express the modifying effect of the field on the action 
of the particles by the addition of J(q) to Si(q). The central formula 
for electrodynamics then is the general result of Eqs. (9-100) and 
(9-101). 


General Formulation of Quantum Electrodynamics. It is 
also of interest to pursue these matters in a different direction, by 
integrating over the matter coordinates first, and leave the field 
variables for later. We shall limit ourselves to a brief general descrip- 
tion of what results from this procedure. If in Eq. (9-98) we contem- 
plate integrating q first, the factor e“/”%s ig a constant and can be 
left out. We can therefore write Eq. (9-98) this way: If we define 


TIA,6] = f (exp [fS + 8.044.691 |) a (9-103) 
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then l 
K = | {exp E 44,99 | T[A,¢] DA Do (9-104) 


This K gives us the amplitude that the particle goes through a certain 
motion and the field undergoes a certain transition. Like all other 
amplitudes, it is the sum over all possible alternatives. Each separate 
alternative is constructed as the amplitude 7[A,¢] for the motion in a 
particular field A, ¢ times the amplitude e**:/* that the field is A, ¢. 
In carrying out the sum, we sum over all possible fields A, .ف‎ 

This law, given by Eq. (9-104), is the general fundamental rule for 
all of quantum electrodynamics. It is a correct formulation even when 
the functional T[A, ¢], the amplitude for the motion of the particles in 
an external potential A, ¢, cannot be represented as a path integral. 
For example, for a relativistic particle with spin (described by the Dirac 
equation), the quantity 7[A,¢] cannot be described by a simple path 
integral based on any reasonable action. However, it is possible to 
calculate T[A,¢] by other means, for example, from the Dirac equation. 
After the form of this functional has been derived, the amplitude K can 
be worked out, in principle, from Eq. (9-104). 

In stating the law of quantum electrodynamics in the form of 
Eq. (9-104), we have isolated the behavior of the electromagnetic field 
from the behavior of the particle (or system of particles) on which it 
acts. That such an isolation can be carried out is an important result. 
For example, the functional T[A,¢] may represent the behavior of a 
nucleus whose properties are not completely known. However, if we 
know only the behavior of the nucleus in an external field, then we can 
solve quantum-electrodynamic problems involving nuclei. 

Of course, to use Eq. (9-104) strictly, 7 must be known as a func- 
tional of A, ¢ for all A, ¢, but this much information is rarely avail- 
able. Even if it is available, the path integral over A, ¢ may not be 
easy. But in practice the formula is very useful. Sometimes T 
can be approximated by an exponential, linear in A, ¢, of exactly the 
form of Eq. (9-99). Then the result is obtained directly from the 
general formula of Eqs. (9-100) and (9-101). More often, T can be 
represented by a sum, or integral, over such exponential forms with 
various p, j and the result of Eq. (9-104) is a corresponding sum or 
integral over expressions containing e“/! with J in Eq. (9-101) involv- 
ing the corresponding p, j. 

In most practical situations T can be expressed as a power series 
in A, ¢. The first few terms can be found from the theory of the 
matter considering A, ¢ as a small perturbation. Subsequent sub- 
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stitution into Eq. (9-104) and integration on A, ¢ gives a corresponding 
perturbation expansion (in powers of e?/fic) for K. The necessary path 
integrals such as 


f A:(R tı) Aj(Rayts) {exp li S.4, |} DA Do 
= 2hd,[(t: — te)%e? — [Ri — Ral 


can be discovered by expanding the general formula of Eqs. (9-100) and 
(9-101) on both sides in powers of p, j and comparing corresponding 
terms. We shall not go further into these matters here, but refer 
the reader to the literature (e.g., sec. 8 of R. P. Feynman, Mathe- 
matical Formulation of the Quantum Theory of Electromagnetic 
Interaction, Phys. Rev., vol. 80, p.. 440, 1950). 
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Statistical Mechanics 


In preceding chapters we have discussed transitions in which a system 
goes from one known state to another. In most physically realistic 
situations the initial state is not completely determined. The system 
may be in one or another state with different probabilities associated 
with each. In this case the final state is equally uncertain, being that 
set of states resulting from the various possible initial states with the 
corresponding probabilities. Or-we may not be interested in the prob- 
ability to go to just one special final state, but rather the chance to 
end up in any one of a set of such states. 

An especially interesting case of statistical uncertainty of states is 
that corresponding to thermal equilibrium at some temperature T. 
A quantum-mechanical system in thermal equilibrium can exist in one 
or another energy level. The results of quantum-statistical mechanics 
show that the probability that a system is in a state of energy E is 
proportional to e~#/*7, where kT measures the temperature in natural 
energy units. (The conversion factor k, known as Boltzmann’s con- 
stant, is 1.38047 X 10-165 erg/°C, or 1 ev per 11,606°K.) 

In this book we shall neither derive nor discuss this exponential 
distribution law. We emphasize that the energy E is the energy of the 
entire system. If an energy level is degenerate, then each state at that 
particular level has equal probability. This means that the total 
probability for the system to have the particular energy value is 
enhanced by a factor corresponding to the number of states in the 
degenerate level. 

The exponential law given above is not yet a true probability dis- 
tribution, since it has not been normalized. The normalizing factor 
can be written 1/Z, so that the probability that a system should be in 
the state of energy F; (assumed nondegenerate for the time being) is 


. = 1 —E;ß 
Pii 7E (10-1) 


where 8 = 1/kT. This means 


Z= > ene (10-2) 
An equivalent normalization consists in defining an energy F such 

that 

p: = e- 8E: F) (10-3) 


F is called the Helmholtz free energy. Its value is, of course, dependent 
on the temperature T, although the various energy levels E; do not 
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depend on T. It is evident that 
Z = eF (10-4) 


THE PARTITION FUNCTION 


The physical properties of a system in thermal equilibrium can be 
derived from the exponential distribution function. Suppose A is 
the measure of some property and that its mean value in the ith 
state is 


A; = fos AQ: dV (10-5) 


where the integral is taken over the volume V of the system. Then 
the statistical average for A for the whole system is 


A=) يميم‎ = 3) Ac (10-6) 


For example, the average or expected value of the energy itself is 


poe > E ZY Eez 


= ` 77 فقسو‎ E:F) (10-7) 


If the normalizing factor Z is known as a function of the temperature, 
the sum of Eq. (10-7) can be easily evaluated. From Eq. (10-2) we 
have 


X Beth = — 28 7 (10-8) 


This means that 


LISTS Sry Ae ee) os 
E cary لك‎ oT Ri ToT 
8 
ب‎ 10-9 
ag CP) (10-9) 


We have written the derivatives with respect to the temperature as 
partial derivatives because other variables, such as the volume of the 
system or any external fields, which determine the energy levels are 
all held fixed. 


10-1 


Quantum mechanics and path integrals 


It is interesting to see what happens to the expected value of the 
energy if some other variable such as the volume is changed. Suppose 
the system is in a particular state E; and we make a small change in the 
value of a certain parameter, say, a. Using a first-order perturbation 
principle, we find that the first-order change in energy is equal to the 
expected value of the first-order change in the hamiltonian. That is, 


E; + AE; = fọ} (H + AH)¢: dV 
= fpf AH :و‎ 07 (10-10) 
Using the language of classical physics, we would say that the 
ratio AH /Aa is the “force” associated with the parameter a. In case 


this parameter is the volume, the force is minus the pressure. That is, 
we define the concept of force by the relation 


Force X change in parameter = change in energy 


or 
aH 
fa = an (10-11) 
As an example, then, if P = pressure and V = volume, 
— PAV = AE (10-12) 
We write the expected value of the force as 
oH oH 
al اد‎ 
3 O Al OE: a P E, a A ee 
Deas Z da ° z AOG Vas Z da 
(10-13) 
so that 
117 
ja = — r (10-14) 


where 8 and other parameters are held constant. Using Eq. (10-4), 
we can write this as 


oF 
fom ae (10-15) 
If the parameter a is the volume V so that —f, is the pressure P, we 
have 
oF 
Pinte 5 (10-16) 
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When the volume is changed by an infinitesimal amount for a sys- 
tem at a constant temperature, two things happen simultaneously. 
First, each of the energy levels shifts slightly. Second, for the system 
to stay in equilibrium at a constant temperature T (maintained by a 
bath, for example) the probabilities associated with each energy level 
will change slightly (because the energy of the levels changes). If the 
only effect were a change in the energy of the levels, then the change in 
the total energy of the system would correspond to this change aver- 
aged over all the levels. From our foregoing discussion this cor- 
responds to pressure times the change in volume. However, to keep 
the temperature fixed, some readjustment of the population of states 
must occur. Thus the total energy must make an additional change 
which we shall call 00. This additional energy is given up to the 
external system (the bath) whichmaintains the temperature, and it is 
called the heat exchanged. Thus 


dU = —PdV + dQ (10-17) 


We can find dQ easily from the expression for U given in Eq. (10-7). 
When V is altered by the change dV, then each energy level E; under- 
goes the change dE; and the Helmholtz free energy changes by an 
amount dF. Thus the total energy changes by the amount 


dU = 2 dE; eE) + BdF ZE; tE — BIE; dE; eP E P (10-18) 


The first term in this expression is the expected value of dE;, which is 
—P dV, as we have already explained. The remaining two terms con- 
stitute dQ. These two terms can also be expressed with the deriva- 
tives of the sum in Eq. (10-2), and ultimately in terms of F. In 
fact, we find 

°F 


= T -rs 0-19 
dQ Toray 47 (1 ) 


That this is true can be seen also from Eq. (10-17), which gives 


0 oF OF 
2 AG r) av 
oF 
2 10-20 
orev (29:20) 


Equation (10-19) gives the heat exchanged dQ in changing the vol- 
ume by the amount dV while keeping the temperature constant. If 
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we change any other parameter, we shall arrive at an analogous result. 
For example, if we change the temperature T while holding the volume 
V constant, the heat exchanged is equal to the change in total energy. 


That is, 
EET, oF <7 ar Pe 
In general, then, we have the result 
°F °F : 
AQ = -r (ey 7 aT av AY + = aT da Aa E EVIE mAT) (10-22) 
The right-hand side of Eq. (10-22) is of the form T times the total 
change in a quantity S = — (F/T), which is called the entropy. 
That is, 
405 )10-23( 
oF 
S=— aT (10-24) 
U =F — TS (10-25) 


It is evident that the standard thermodynamic quantities, internal 
energy, entropy, pressure, etc., can all be evaluated if a single function, 
the partition function Z, is known in terms of the temperature, volume, 
external field, ete. The thermodynamic quantities are obtained sim- 
ply by differentiating Z, or equivalently, the free energy F. 

The determination of some physical quantities, even for a system 
in thermal equilibrium, requires more information than only the parti- 
tion function. For example, suppose the system is in a configuration 
space with a coordinate x and we ask: What is the probability of 
finding the system at location x? We know that if the system is in 
the single state defined by the wave function ¢;(z), the probability of 
observing x is the absolute square of the wave function ¢;‘(x)¢;(2). 
Thus, averaging over all possible states, the probability of observing 
x is 


P(r) = 3 J F(x) di(a)e*™ (10-26) 


In the general case, if we are interested in any quantity A, then the 
expected value is given by 


BE 22 Asen = 22 [ot @) Aoa): dz (10-27) 
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It is evident that the expected values of all such quantities could be 
obtained if we knew the function 


p(x'x) = X, dia") 6 (weno (10-28) 


This suffices since the function A appearing in the integral of Eq. 
(10-27) is an operator which operates only on the ¢; of that expression, 
and not on ¢;. Using the quantity p(x’,x), we can imagine A to act 
on x’ only, after which we set x’ equal to نه‎ in the form Ap(z'x), and 
finally integrate over all values of x. This process is called finding the 
trace of Ap. 

From the definition of p(x’,x) it is clear that 


Pane T ١ )10-29( 


and since the probability P(x) is normalized, so that the integral over 
all x gives 1, we have 


Z = Jp(x,x) dx = Tr [p] (10-30) 


where Tr is an abbreviation for the word “trace.” The quantity 
p(x’,x) is called the density matrix [more precisely, the statistical den- 
sity matrix for temperature T; the term “density matrix” also has a 
wider use for general systems in or out of thermal equilibrium and is 
usually used for the normalized version of the function p(x’,x), that is, 
the function we would write as p(x’,x)/Z]. The general problem of 
statistical mechanics is to evaluate Eq. (10-28) to find the density 
matrix. If we are interested only in conventional thermodynamic 
variables, we need only the trace or diagonal sum of the density matrix, 
which gives us the partition function Z. 


THE PATH INTEGRAL EVALUATION 


The formulation of the density matrix given in Eq. (10-28) bears a 
close resemblance to the general expression for the kernel, which was 
derived in Chap. 4 and given in Eq. (4-59) as 


K (@ajto;rajtr) = Y, ġ;(22) Q} (te WP FG (10-31) 
7 
The validity of this expression is restricted to situations in which the 


hamiltonian is constant in time and tz: > t However, this situation is 
implied in statistical mechanics; for only if the hamiltonian is constant 
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in time can equilibrium be achieved. The difference between the 
form of Eq. (10-31) and that of Eq. (10-28) is in the argument of the 
exponential. If the time difference t — tı of Eq. (10-31) is replaced 
by — ibh, we see that the expression for the density matrix is formally 
identical to the expression for the kernel corresponding to an imaginary 
negative time interval. : 

We can develop the similarity between these two expressions from 
another point of view. Suppose we write the density matrix in a way 
which makes it look a little bit more like a kernel, thus, k(£2,U2;£1,U1) 
for p(£2,x1), where 


K(x,usjrijur) = X, Qi(t2) p? (are Wain Be (10-32) 


Then if ينه ,’2 = ونه‎ = 2, uz = RB, and u; = 0, Eq. (10-32) becomes 
identical with Eq. (10-28). 
If we differentiate k partially with respect to uz, we get 


-h SE = Y Beales) ot (ese toons دمل‎ 


But now we recall that E:¢;(z’) = H¢,(x’); so if we understand و2‎ to 
imply operations only upon the variables روه‎ we can write 


oir roa) (10-34)‏ ل 
dus‏ 
or, to put the same thing another way,‏ 
eC) = HOD (10-35)‏ = 


We notice that this differential equation for p is similar to the 
Schrédinger equation for the kernel K which was developed in Chap. 4 
and given in Eq. (4-25). We can rewrite it here as 


_ 4 aK(2,1) 


7 = HK (2,1) for to > ty (10-36) 
0 و01‎ 


We found in Chap. 4 that the kernel K(2,1) is Green’s function for 
Eq. (10-86). In the same sense the density matrix p(2,1) is Green’s 
function for Eq. (10-35). 

With simple hamiltonians involving only momenta and coordinates 
we have been able to write the kernel as a path integral. For example, 
if the hamiltonian is given by 


h? d? 
H = — 5 54+ VO) (10-37) 
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then the solution for the kernel over a very short time interval 
te as ty - 6 


is 


im (z2 — zı)? i 
K(2,1) = 4 وا‎ exp E SRN =e eV (2t#)| (10-38) 


which can be directly verified by substitution into Eq. (10-37). By 
building up a product of many kernels of the form of Eq. (10-38) and 
taking the limit as the time interval e goes to 0 and the number of 
terms in the product becomes infinite, we have produced a path integral 
describing the kernel over a finite period of time. We can produce a 
solution to Eq. (10-34) in the same manner. The solution for an 
infinitesimal interval of uz — ui = 7 is given by substituting e = — in 
into Eq. (10-38). Thus 


exp CAS aV es + 20/2)‏ ل لكا ا 


(10-39) 
That this is a valid solution of Eq. (10-34) can be demonstrated by 
direct substitution of Eq. (10-39) into Eq. (10-34). 
The rule for the combination of functions defined for successive 
values of u is the same as the rule for the combination of kernels for 
successive intervals of time. That is, 


k(2,1) = fk(2,3)k(3,1) drs (10-40) 


That this result still holds follows from the fact that Eq. (10-33) is a 
first-order derivative in u. This rule can be used to obtain the path 
integral to define k(2,1) as 


N—1 


K(%2,U2;21,U1) = 1 (exp 8 D E ipa كرو‎ 1 vé ||) 
N-1 
x ne (10-41) 
t=1 0 
The normalizing constant a now becomes 
a = „|2 (10-42) 
m 


and the integral is carried out over all paths going from zı to za (that is, 
zi is zı for 2 = 0 and zz for i = N) in the interval us — uı = Nn. 
The result of this derivation is that if we consider a “path” z(u) 
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as a function which gives a coordinate in terms of the parameter u, and 
if we call 2 the derivative dx/du, then 


p(t2,21) = f 6 8 ر‎ ip 8 i?(u) + ve) | au} ) Da(u) (10-43) 


This is a very amusing result, because it gives the complete statis- 
tical behavior of a quantum-mechanical system as a path integral 
without the appearance of the ubiquitous 7 so characteristic of quantum 
mechanics. (Incidentally, this is not so for a system moving in a 
magnetic field.) This path integral of Eq. (10-43) is much easier to 
work with and visualize than the complex integrals which we have 
studied previously. Here, it is easy to see why some paths contribute 
very little to the integral; for there are paths for which the exponent 
is very large and thus the integrand is negligibly small. Furthermore, 
it is not necessary to think about whether or not nearby paths cancel 
each other’s contributions, since in the present case all contributions 
add together with some being large and others small. 

The parameter u is not the true time in any sense. It is just a 
parameter in an expression for the density matrix p. However, if we 
wish to make use of analogy in our thinking, we can consider u as the 
time for a certain path, and in so doing we can state the result given by 
Eq. (10-43) in a vivid pictorial way. What we are doing is providing 
a physical analogue for the mathematical expression. We shall call u 
the “time,” leaving the quotation marks to remind us that it is not 
real time (although u does have the dimensions of time). Likewise 
šã will be called the “velocity,” mz?/2 the “kinetic energy,” ete. Then 
Eq. (10-42) says that the density matrix for a temperature 1/8 is 
given in the following way: 

Consider all the possible paths, or “motions,” by which the system 
can travel between the initial and final configurations in the “time” Bh. 
The density matrix p is a sum of contributions from each motion, the 
contribution from a particular motion being the “time” integral of the 
“energy” divided by 7 for the path in question. 

The partition function is derived by considering only those cases 
in which the final configuration is the same as the initial configuration, 
and we sum over all possible initial configurations. 


Problem 10-1 Show that the expression for the density matrix 
for a system consisting of a harmonic oscillator is 


Cate) wml ene ست‎ ee 
: 2rh sinh whh 2h (sinh w8h)? 


X [(z + x?) cosh wh — 2xx']} (10-44) 
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This answer can be compared with the results of Prob. 3-8. Show 
also that the free energy is kT In [2 sinh (hw/2kT)}. Check this latter 
value by a direct evaluation of the sum of Eq. (10-2). 


If the temperature is not too low (how low is too low will be dis- 
cussed later on in this chapter), 87 is very small. Thus, in calculat- 
ing the partition function for which x, = va, each path starts from zı 
and in a very short “time” is back to x; again. In fact, the paths can- 
not ever wander very far from 2, because traveling far away and 
returning again in the short “time” available requires a high “velocity” 
and a large “kinetic energy.” For such a path the exponential func- 
tion appearing in Eq. (10-43) becomes negligibly small, and it will 
contribute a negligible amount to the sum over all paths. Under 
these circumstances the paths z(u) which must be considered in 
evaluating V[x(u)] never move very far from the initial point 2ı. 
Thus to a first approximation we can write V[x(u)] ~ V[zı] for all 
paths. In this approximation the potential energy is independent of 
the path, and the exponential function dependent on the potential can 
be taken outside the integral. Thus for temperatures which are not 
too low we have 


= eve [7 Sf gd نهعم‎ z 
PELT = 6 a {exp | oF Io £?(u) du |} Da(u) (10-45) 

In this last expression the path integral is that for a free particle. 
It can be solved in the same way that we solved the path integral 
defining the kernel for the motion of a free particle in Chap. 3. The 
result is 


je {exp |- a 1 22) du |} Da(u) 
mkT | mkT (xz, — 
wht P| 212 


a (10-46) 


If we are interested only in the partition function, we set £s = zı and 


find 


p(r1,21) c= a eV (a1) (10-47) 


Then, the partition function is the integral of this expression over all 
possible initial configurations zı. Thus 


mkT 
2rh? 


This is a formula for the partition function valid in the limit of 


7 fi eV (= ريو‎ (10-48) 
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classical mechanics. It was originally derived, within an uncertain 
multiplying constant, as a consequence of classical mechanics by 
Boltzmann. In more complicated cases (e.g., more variables) the 
classical partition function is simply the product of two factors. The 
first of these is the path integral which one would get by considering all 
particles of the system to be free. The second factor is called the 
configuration integral, and it is just the integral of e, where V 
is the potential of the system and depends upon all of the N variables 
describing the system. For example, for N particles interacting by a 
potential V(x:,x2, . . . ,Xw), where x, is the vector position of particle 
a, the integral required is 


f{exp [—BV(x1,X2, . . . ,Xw)]} 0251 dx» - ٠ °` d*xy 


This simple form for the partition function is only an approximation 
valid if the particles of the system cannot wander very far from their 
initial positions in the “time” 6h. The limit on the distance which the 
particles can wander before the approximation breaks down can be 
estimated in Eq. (10-46). We see that if the final point differs from 
the initial point by as much as 


Ax (10-49) 


ool 

JV mkT 
the exponential function of Eq. (10-46) becomes very greatly reduced. 
From this, we can infer that intermediate points farther than Az away 
from the initial and final points can be reached only on paths which do 
not contribute greatly to the path integral of Eq. (10-43). If the 
potential V (x) does not alter very much as x moves over this distance, 
then the classical statistical mechanics is valid. 

For a typical solid or liquid at room temperatures, with an atomic 
mass of about 20, for example, Az is about 0.1 A, while the interatomic 
distances and forces range over one or two angstroms. Thus motions 
greater than 0.1 A will not contribute to the density matrix, while the 
potential function will remain unchanged until the motion of the order 
of one or two angstroms has been achieved. It is clear that classical 
statistical mechanics is adequate for such materials. 

All of the mysterious transformations between solids, liquids, and 
gases ordinarily lie in a range where classical statistical mechanics is 
valid. The mathematical interpretation of all these processes is con- 
tained in the problem of evaluating the integral of e 7# over the coordi- 
nates of all the atoms. That this amazing variety and peculiarity 
of phenomena comes from just a simple integral is at first surprising, 
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until it is realized that the integral is a multiple integral over a stu- 
pendous number of variables. Our usual experience with integrals 
which involve one or, at most, a few variables of integration does not 
prepare us for the almost qualitative differences that can arise when 
the number of variables approaches infinity. 

The fascination of the problems in the theory of the solid states, or 
of liquids and of the condensation of gases, lies, like the behavior of 
this multiple integral, in the way in which simple descriptions of simple 
systems when joined together in enormous multiplicity yield such rich 
phenomena. It is a challenge to the imagination to see how the 
cooperation between systems can lead to such results. A rough 
qualitative explanation is readily forthcoming for many of these effects, 
but the problem of quantitative detail also holds fascination for the 
theoretical physicist. 

There are important statistical phenomena which occur when the 
classical approximation is not valid. In this case the multiplicity of 
variables is compounded with the conceptual complexity of quantum 
mechanics to raise even greater challenges. 

Strictly speaking, Eq. (10-48) does contain a little more information 
than was available to the purely classical statistical mechanics. This 
is evidenced by the appearance of 7 in the coefficient in front of the 
integral. Classical mechanics could not determine the partition 
function absolutely, but only within an unidentified constant factor. 
Thus the logarithm of the partition function was determined only to 
within an additive constant. That meant that a term proportional to 
T appeared in the expression for the free energy, or an additive con- 
stant appeared in the entropy. This constant, which was sometimes 
called the chemical constant, could be completely evaluated only 
after the quantum-mechanical solution was worked out. 


QUANTUM - MECHANICAL EFFECTS 


We have mentioned that there are some cases in which the classical 
approach is not adequate. For these cases it is necessary to include 
changes in the potential function which result from the motion along 
the “path.” In this section we shall calculate the first-order effect 
of the potential when the motion of the particle is taken into account. 

Instead of approximating V(x) by the constant value V (zı) in the 
expression for the density matrix, Eq. (10-43), we might try a Taylor 
series expansion for V around the point zı. However, we would find 
that we could save effort and increase our accuracy if we chose to 
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expand about the mean position given by 

1 spa 
T = سدم‎ 0 
aa k z(u) du (10-50) 


which is defined for any particular path. We can characterize each 
path by its mean position and carry out integrations over all such 
positions instead of integrations over initial positions 21, as was done in 
Eq. (10-48). In this way the partition function becomes 


Z = fdz [2 (e (- z {5 [atau + fm Viz(u)] au} ) | a 


In this expression the paths are chosen to satisfy two conditions: (1) 
that 2 given by Eq. (10-50) is fixed and (2) that the initial and final 
points are identical (which implies that the integral over all paths also 
includes an integration over all end points 21). 

Using a Taylor series expansion for V(x) about the point ž, we find 


ie è Vie (u)] du = BAV (2) + A ^^ ielu) — ZIV’ (2) du 
ve 5 1 ^ el(u) — 72V” )2( du (10-52) 


By virtue of Eq. (10-50) the second term on the right-hand side of this 
last equation is zero. Thus, by expanding about the mean position, 
we arrive at an expression for which the first nonzero correction term is 
of second order. Using this expansion and including no terms of 
higher order than the second, we have for the partition function 


= j e 8® dz ae | exp (- je 5 2 
TIF a av"@)| (| Delu) (10-53) 


The path integral in this expression differs from those of our previous 
experience in one particular way. The paths over which the integral 
is to be evaluated are constrained by Eq. (10-50), which can be 
rewritten for present purposes as simply 


JE @- 2) du =O 


The substitution y = z — 2 as the path coordinate then gives the 
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constraint in the form 

Ba 
R y du = 0 
and the path integral itself is 

i { [Am 1 5 du 
ee (exp | jk E ý? + 5 y? V o] +) Dy(u) (10-54) 

The integrand of this path integral is the same as that for a harmonic 
oscillator, with the frequency given by w = —V’’(0)/m. 


We now apply the constraint to this path integral in the following 
way. We multiply the whole path integral by the delta function 


ph 5 : noA 
ô ( 1 0 du). In order to manipulate the ô function within the path 
integral, we express it by its Fourier transform, 


èla) = fT, lexp (ike) Ê 


and write 


_ a ipa {exp | 5 i g G vert 5 Vy? + ity) au |} Dy(u) 
(10-55) 


In this form, the path integral contains the constraint of Eq. (10-50), 
and we can therefore proceed directly with standard path integral 
techniques to obtain the desired solution. We note that the integrand 
of the path integral now has the same form as the path integral for 
a forced harmonic oscillator if we interpret both m and V” to be 
imaginary. However, we are interested only in the case V” small, 
and the approximation of including only the first-order term V” can 
be made at any convenient stage. 


Problem 10-2 Use the methods of Chap. 3 and, in particular, 
Eq. (3-66) to solve this path integral. Remember that paths of 
interest in this problem have the same initial and final points and that 
completion of the path integral requires an integration over all values 
of this point. Finally, carry out the integration over all values of k 
to get as a solution 


E Bh? 
Const [i Fim 


The partition function which results from the solution obtained in 


yV% @) | to first order in V” (10-56) 
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Prob. 10-2 is written best in the form (valid to first order in V”) 


7 i (cxp م‎ [væ +E vr |}) 0 )10-57( 


Here the unknown constant has been evaluated simply by comparison 
with the classical result of Eq. (10-48). We see from this result that 
the partition function has the same form which we derive under clas- 
sical assumptions. The only difference is that a corrective term has 
been added to the potential. This corrective term, (Bh?/24m) V” (2), 
is clearly quantum-mechanical in nature, as can be seen from the inclu- 
sion of Planck’s constant ñ}. 


Problem 10-3 Show that for many particles (which we identify 
by subscripts so that the mass of the ith particle is m;) moving in three 
dimensions, the correction to the potential is 
Bre Nat 


(10-58) لحري 


In practice, the results of this calculation are not very useful. In 
most problems, e.g., in a gas of colliding molecules, the potential rises 
very sharply so that there is a violent repulsion at small distances. 
In such a case the second derivative is very large. When this is not 
so, the formula may be of some use. It has one advantage in that it 
may be easily extended to another order of accuracy. 


Problem 10-4 Show that the correction to the partition function 
up to the order of 74 contains the factor 


222 0 7B4h4 is 
5 E x Tam een 
Bh? WN 0-3 
Rm OT | 


We have seen above that quantum-mechanical effects might be 
represented by calculating the partition function as in the classical 
formula of Eq. (10-48), but instead of using the correct potential V(x), 

: : h? : 
we use a modified potential V + f V”. This suggests that we try 
to go further and seek some possibly better effective potential U (ë) 
which, when substituted for the true V in the classical equation (10-48), 
would represent a good approximation to the correct quantum- 
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mechanical partition function. We start out with the exact expression 


2 1 {exp ]-877)2([( dz f exp (3 f adu 


- 1 f View] - Via} du) De(u) (10-59) 


We can look upon the path integral in this expression as representing 
the average over the paths z(u) of the function ef with 


f= - f view) - via Z (10-60) 


where the averaging process is carried out with the weighting function 
exp [—(m/2h) fa? du]. If we were-to replace this average of an expo- 
nential with the exponential of an average, thus 


(ef) ج‎ e (10-61) 


we know we would make an error of the second order in f or, better, 
of the order of the difference between (f)? and (f2). We shall see in 
Chap. 11 that we can determine the sign of this error, i.e., the left- 
hand side is greater than the right. 

We shall obtain an average value of f for each value of 2, that is, 


ie [|ex(- a 8 i? du) | Ja {V[x(t)] — VIZI} dt Dz (u) 
(10-62) 


where we constrain the path to return to its starting point, and to 
satisfy the integral restraint given by the definition of Z in Eq. (10-50). 
To evaluate this path integral, we define a somewhat different, but 
related, path integral 


1a) = ff | exw (- ah v au) (VIE + y(O] - Vil} Dy(u) dY 
(10-63) 
where the paths y(u) are subject to the restriction 


vO = y(t) = Y f y(u) du = 0 (10-64) 


Now, it happens that 7)2( is independent of t. We can see this 
with the help of the following argument. Suppose each path in the 
integral is not of finite length, but is really a Bh-length segment of a 
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Fig. 10-1 All paths which return at t = 87 to their initial value (at t = 0) can be con- 
sidered as 8A-length segments of periodic paths where the period is Bh. 


periodic path whose period is also 867, as shown in Fig. 10-1. Con- 
sider two of the families of all such paths, one y(t) and the other 
y(tı + t) = ,)رن‎ as shown in Fig. 10-2. The value which the first 
attains at t = tı, namely, y(t1), is reached by the second when its argu- 
ment is 0, that is, yı(0) = y(t:). Furthermore, for any other point 


| 
| 
| 
| 


Fig. 10-2 Suppose one of the “‘periodic’’ paths y(t), as shown in Fig. 10-1, has the value 

y(tı) at t = tı. Then the collection of all “periodic” paths must contain this same path 

slipped left a distance tı, that is, y(t + tı), which will have this same value até = 0. The 

result of a path integral average over all such paths must then be independent of the 
selection of the initial point on the ¢ axis. 
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t; there exists in the family the analogous function y,(é) for which 
y:(0) = y(t), and all such paths give the same contribution to 


ihe y? du 


Of course, all these statements apply to each path included in the 
path integral. Thus we see that we lose nothing by arbitrarily setting 
| = 0 in the path integral over all paths y(u), which is the same as 
saying that the integral is independent of t. 


Problem 10-5 Using the method outlined for Prob. 10-2, show 
that (f) and I(3) are related by 


1(@) = [gaa | VE + Y) = Være ay = 2 0-65) 


Suppose we call our approximation to the partition function Z’ and 
call the Helmholtz free energy to which it belongs F’, so Z’ = قحم‎ 
Using the results of Prob. 10-5 and Eq. (10-61) we have 


= ji (exp {— BIV (2) + I(2)]}) dz f | exp )- 7 p i? au) | Dz(u) 


The path integral appearing in Eq. (10-66) is the integral evaluated 
in Eq. (10-46). Thus, we can write 


mkT 
2rh? 


epr f e bU) dz (10-67) 


where 


12m 


Sd 10-68 
2rbh? ( ) 


0)2( = | VE + yeme dy 
and the term V (z) has canceled out. 
These results mean that we can calculate an approximate free 
energy F” in a classical manner [i.e., with an expression like Eq. ) 10-48)] 
and get a good approximate result if we use an effective potential 
0 )2(, as defined by Eq. (10-68), in place of V (z). Incidentally, we 
note that the effective potential is temperature-dependent. i 
The effective potential is a mean value of V(x) averaged over points 
near 2 in a gaussian fashion, where the mean-square spread (or stand- 
ard deviation) of the ganssian weighting function is )872/ 12m)”. 
Furthermore, if we follow through the various inequalities which are 


involved in our approximation, we find that the approximate free 
energy F’ exceeds the true free energy F. The details of this are dis- 
cussed in the next chapter, Eq. (11-9) and following. 


Problem 10-6 Show that the relation of Eq. (10-68) becomes the 
“corrected” potential of Eq. (10-57) (that is, the argument of the 
exponent in that equation) if V is expanded as a Taylor series. 


Problem 10-7 Test the validity of the approximation as it applies 
to the harmonic oscillator, for which the exact value of the free energy 
is i 


: hw 
Peas = EL In (2 sinh ar) (10-69) 


Evaluate the approximate value for the free energy by means of the 
approximate potential U. Show that 


_ me? Bh 1 
and that 
5 hw (hw)? 
nee SL (in w) + ART )10-71( 


Determine the free energy or, better, the ratio of the free energy to 
kT, for various values of frequency. It is suggested that the values 
of 1, 2, and 4, for example, be used for the ratio iw/kT. Show that F’ 
is greater than F as expected and that the errors grow as the tempera- 
ture falls. Note that if we are even very far from the classical region 
(e.g., when the ratio iw/kT = 2, so that the system has an 85 per cent 
probability of being in the ground state) the approximate results are 
still surprisingly close to the true results. 

Compare these results with those obtained with the classical approxi- 
mation in which the free energy is given by kT ln (iw/kT). These 
results should show the values of 2F/hw as given in the accompanying 
table. 


0.08266 
0.08333 
0.00000 
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10-4 SYSTEMS OF SEVERAL VARIABLES 


If a system has several variables, the formulas describing them are 
obtained by direct extension of the methods we have already studied, 


except for some special problems which arise from consideration of 
symmetry properties. 


Liquid Helium. As an example consider the problem of finding 
the partition function for liquid helium. Suppose we have N identical 
atoms, each of mass m, confined in some volume. Suppose further 
that atoms interact in pairs through a potential V(ri2). This poten- 
tial is a weak attraction at large distances and a very strong repulsion 
at short distances. Just to orient our thinking, we might imagine V(r) 
as the potential for hard spheres. That is, 


V(r) =0 >0 
VQ) — S SSG a = 2.7 À )10-72( 


The lagrangian for such a system has the form 
= 1 l2 1 E È 
te 23 n|Ri| 22 V(r.) (10-73) 


which means that the partition function is 


Z= | Ro / È (- (Fi le IR()|? dt 


d; D he VIR) — R;(0] (| DYR(t) (10-74) 


In this last expression the symbol d¥R stands for d?R; dR,» d'R; ٠ - 
dRy. Similarly, DYR means DR, DR: DR; ٠ ٠ ° DRy. We assume 
that all the path integrals are taken between initial points R,(0) and 
final points R;(8) such that R;(0) = R,(@). 

The form which we have written down in Eq. (10-74) is actually not 
correct. The symmetry properties which we mentioned above will 
affect this result. This characteristic is one of the interesting features 
of quantum mechanics of identical particles. In Chap. 1 we mentioned 
that if an event occurs in two indistinguishable ways, then the ampli- 
tudes for the two ways will add. In particular, when we are dealing 
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with indistinguishable particles, one alternative way for accomplishing 
any event always exists; namely, the interchange of two particles. In 
such a case the amplitudes for the particles (1) as interchanged and 
(2) as not interchanged must be added. (This addition rule applies to 
Bose particles. For Fermi particles the contributions for amplitudes 
which arise from odd permutations of particles will subtract from each 
other.) Ordinary helium atoms are of isotopic mass 4 and contain 
6 particles: 2 protons, 2 neutrons, and 2 electrons. This means that 
helium atoms are Bose particles and the amplitudes for interchange 
of particles add. (For instance, we say that Bose particles follow 
symmetrical statistics, whereas Fermi particles follow antisymmetrical 
statistics.) 

To see how this addition of amplitudes comes about, at least for 
helium atoms, we can follow this line of argument: In the final state 
the atoms cannot be distinguished from each other. Thus, although 
the appearance of the configuration of atoms may be the same finally 
as it was initially, the identity of some of the atoms may have been 
exchanged. 

For example, an atom which we shall designate as 1 starts a position 
zı(0). We have assumed that some atom at least will be in this same 
position at the close. Thus, for some atom 2z(8) is equal to 21(0). 
However, it may not be atom 1 which ends up in this particular place. 
Instead, atom 1 may go to the initial position of atom 2, say zx2(0), 
while at the same time atom 2 has moved into the initial position of 
atom 1. That is, it is possible that atoms 1 and 2 exchange places in 
the final configuration. 

To describe this situation in the most general terms, let Pz; stand 
for some permutation among the atoms which are initially at x: 
Thus, for example, in the situation in which atoms 1 and 2 were 
exchanged and all others remained where they were, we would have 


Pti = Pr: = 21 Pr; = 23 
Pty = ty ۰. ۰ ° _ (10-75) 


In general, the final state can be any permutation of the initial state: 
a;(8) = P2,(0) (10-76) 
Thus in order to construct the complete amplitude, we must sum ampli- 
tudes over all the N! possible permutations, since each permutation 


represents an alternative possibility. The normalization is correct 
if we average over all the permutations. The resulting rules for 
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statistics mean that Eq. (10-74) must be replaced by‏ ته 
ea dy i PR:(0) TR. Bh o‏ 
if mid PRO fer, [ex p( TON! piai‏ 
VIR«(t) — R,(t)] (| DYR(t) (10-77)‏ * 1 > + 
F‏ 


where ye means a sum over all permutations P. 


P 

If we were dealing with Fermi particles, e.g., the isotope of helium 
which has three nucleons, we would have to include an extra factor of 
+1, positive for even permutations and negative for odd permutations. 
There would also be some extra features which depend upon the spin 
of the atom in our result. 

It is possible to give a more detailed derivation of Eq. (10-77) in 
the following manner. For helium-4 atoms the quantum-mechanical 
amplitude for two atoms which start at positions a and 5 to get to 
positions c and d is 


K(c,a;d,b) + K(d,a;c,b) (10-78) 


(Amplitudes for alternative final conditions add, since these con- 
ditions cannot be distinguished from each other.) In this expression 
K(c,a;d,b) is the complex amplitude to go from a to c while the other 
particle is going from 5 to d. 

Since the particles are indistinguishable, their symmetry properties 
imply that the amplitude to find the two particles eventually at the 
points c and d must be a symmetric function of cand d. That is, the 
wave function ¥(c,d) must be a symmetric function of the variables 
Vig ta SL DAL is, 


v(c,d) = ¥(d,c) (10-79) 


If the particles were Fermi, the wave function would have to be an 
antisymmetric function of these positions. 
If many particles are involved, the rule is simply extended, that is, 


WAT 2.8 eae IV) Pe VC Soyo se N) 

= ¥(1,2,4,... ,N) 

= etc. (10-80) 
The simplest statement of the general rule is that the wave function 
must be symmetric (antisymmetric for Fermi particles). Although 


other solutions of Schrédinger’s wave equation exist, only symmetric 
and antisymmetric ones appear in nature, Hence in the sum defining 
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= the partition function in Eq. (10-2), we do not wish the sum over all 


energy eigenvalues of the hamiltonian H which can be obtained from 
solution of HQ, = Enn, but only over those for which the wave 
function ¢, is asymmetric function. For example, the density matrix 
p(x’,z) is defined by Eq. (10-28) with a disregard for the statistics 
of the N atoms involved. How can we reduce this sum to include only 
symmetric wave functions? 

To accomplish this reduction, we use the following trick. First we 
notice that from any function a symmetric function can be produced 
simply by permuting all variables and adding together the resulting 
functions. Thus, for any function f(x1,72) the combination f(zı,z2) + 
f(x2,21) is a symmetric function. It follows that for any wave func- 
tion $(11,%2, . . . ,tv) the function 


4)) = إلا‎ ¢(Pz:) (10-81) 
P 


is symmetrical: Now if ¢,(a:) is a solution of the Schrödinger equa- 
tion, then ¢/,(z,) as defined by Eq. (10-81) is also a solution, since the 
hamiltonian H is symmetric for an interchange of coordinates. There- 
fore, each interchanged form ¢,(P2) is a solution, as is the sum. 
Some of the energy eigenvalues E, have eigenfunctions ¢, which are 
symmetric, and some do not. Suppose Ez is an energy eigenvalue for 
which the Schrédinger equation does not have a symmetric solution. 
Then the sum D (Px) must vanish, since if it existed it would be a 
P 


symmetric solution for Er. This result implies that the operation 
defined by Eq. (10-81) selects just those solutions to the wave equations 
which are symmetric. All other solutions vanish. If ¢,(x) is sym- 
metric, then it is equal to ¢,(Px); and since there are N! ways of 
permuting the N atom, we have 


2 (ربندط) ,ف‎ = N!on(ai) if ¢, is symmetric 
P 


= 0 if ¢, is of any other symmetry (10-82) 


These results give us an answer to our question. We can now select 
out of the sum defining the density matrix those particular elements 
which apply to symmetric states. Thus 


all 
Y, e(Pa’ x) = X Y bn(Px’) ين‎ (ejet 
P Rf BP 
sym 


= N! J ¢n(t)Qn(z)e-3 


n 


= N an EE) (10-83) 
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This is the reason why in Eq. (10-77) defining the partition function 


for symmetric statistics we permute all the particles and divide by N!. 
The resulting partition function corresponds to 


sym 


f Paw 202003250 = Zam = > e PEs (10-84) 


n 


We note some of the features of Eq. (10-77). At high temperatures, 
we should expect a classical solution for the partition function with no 
quantum-mechanical effects in evidence. Suppose we disregard the 
effects of the potential for the moment and consider the effect of the 
motion of an atom from its initial point to some other point a distance 
0 away. In the path integral of Eq. (10-77), this is a motion from the 
initial point R;(0) to the permuted position PR,(0), and the contribu- 
tion of that particular permutation to the sum over all permutations 
is proportional to exp (—md?kT/2h?), thus decreasing with increasing 
temperature or increasing spacing between the atoms. Hence, unless 
the atoms are extremely close together, no permutation in the sum is 
important—even the simplest interchange between two atoms—in 
comparison with the identity permutation which leaves all atoms in 
their original location. If we include the effects of the potential 
which increases steeply at a radius of 2.7 A from the center of an atom 
in liquid helium, then no configurations in which the atomic spacing is 
less than this value are important. 

Since only the identity permutation makes a significant contribution 
to the summations, all that remains for our consideration is the factor 
1/N!. In the early days of classical thermodynamics it was realized 
that such a factor was convenient when dealing with identical particles, 
but its significance was not clearly understood. Its effect on the 
chemical constant is called the entropy of mixing when systems of 
several different kinds of atoms are studied. 

As the temperature falls, the exponential factor exp (—md?kT/2h?) 
prejudicing against migrations to new final positions becomes smaller 
and smaller. This means that at extremely low temperatures new 
terms become important in the summation over permutations. Of 
course, the quantum modifications must be included; and we saw they 
could be included as a first approximation by replacing the potential V 
by a corrected potential U. As the temperature falls, the specific 
heat of liquid helium begins to rise slightly near about 2.3 or 2.4°K. 


Problem 10-8 The density of liquid helium is 0.17 g/cm’. Give 
an order-of-magnitude estimate of the temperature at which perturba- 
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tion terms should begin to play an important role in the description of 
liquid helium. 


At first sight, one would not expect very elaborate exchanges of 
atoms to ever be important. An exponential factor involving the 
spacing must be included each timè an atom moves to its neighboring 
location. If we call this factor y, then for r atoms to move to neigh- 
boring spots the factor y” must be included, and since y is certainly 
less than 1 at any temperature, y” could become quite small for large r. 
We certainly would think that as r approaches any reasonable fraction, 
of the approximately 1022 atoms in a cubic centimeter of liquid helium, 
contributions from factors like y” must be infinitesimal. However, 
this first sight does not take into account the fact that with r atoms 
permuting, there are an enormous number of possible permutations, r!. 
Thus the small weight of one particular permutation is offset by the 
large number involved. 

Another question which arises in the description of liquid helium 
concerns the type of permutations which are involved. Any permu- 
tation can be described by cycles; thus 1-4, 4-7, 7-6, 6-1 is a cycle. 
Are the important cycles long or short? A careful estimate shows 
that at moderate temperatures, only simple exchanges of two atoms 
are important. Then as the temperature falls, cycles of three atoms 
become important, then four, and so on. But then suddenly, at a 
certain critical temperature, cycles of much greater length L offset 
by their great number the small value of y7. At this temperature 
cycles of importance become very long, involving nearly all of the 
atoms inside a container. At this point the curve of specific heat vs. 
temperature shows a discontinuity. Below this temperature the 
behavior of the liquid is very strange. It flows through very thin 
tubes without any resistance for low velocities. It stimulates infinite 
heat conductivity in bulk, ete. These odd characteristics are mani- 
festations of quantum mechanics, particularly the constructive inter- 
ference between amplitudes for replacing one atom with another. 
Quantitatively, the details of the behavior of the specific heat just at 
the transition temperature are not on a very firm foundation. But the 
qualitative reason for the transition is clear.! 

The expression analogous to Eq. (10-77) for Fermi particles, such as 
He’, is also easily written down. However, in the case of liquid He’, 
the effect of the potential is very hard to evaluate quantitatively in an 
accurate manner. The reason for this is that the contribution of a 


1 A more detailed discussion of the partition function of liquid helium from this 
point of view may be found in R. P. Feynman, Phys. Rev., vol. 91, p. 1291, 1953. 
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cycle to the sum over permutations is either positive or negative 
depending on whether the cycle has an odd or even number of atoms 
in its length L. At low temperature, the contributions of cycles such 
as L = 51 and L = 52 are very nearly equal but opposite in sign, and 
therefore they very nearly cancel. It is necessary to compute the 
difference between such terms, and this requires very careful calcula- 
tion of each term separately. It is very difficult to sum an alternating 
series of large terms which are decreasing slowly in magnitude when a 
precise analytic formula for each term is not available. 

Progress could be made in this problem if it were possible to arrange 
the mathematics describing a Fermi system in a way that corresponds 
to a sum of positive terms. Some such schemes have been tried, but 
the resulting terms appear to be much too hard to evaluate even 
qualitatively. 

For molecules which are separated by distances in the neighborhood 
of 1 A we have seen that the effects of exchange (the nonidentical 
permutations) are important only when the temperature is down to a 
few degrees absolute. In contrast to this, consider the behavior of 
electrons in a solid metal. The mass of the electron is so much smaller 
than that of a molecule that the critical temperature is much higher. 
At room temperatures, electrons in a metal are described accurately 
only by equations which include the exchange effects of these cyclic 
permutations. From this point of view, room temperature is very 
cold for electrons. The exchange effects are of dominant importance, 
or, to put it another way, the electron gas is degenerate. Of course, 
the electrons interact by Coulomb’s law, which is quite strong. But 
since the effects of the Coulomb attraction are of long range, they 
tend to average out. To a fair approximation, the electrons act as if 
they were independent, although, of course, each moves in the same 
periodically varying potential produced by the arrangement of the 
nuclei and the average of the positions of neighboring electrons. From 
the study of the ideal Fermi gas neglecting interactions, we can learn 
a lot about the behavior of electrons in metals. 

However, it is apparent that we cannot learn quite enough, for 
the superconductivity of metals occurring below a few degrees absolute 
would remain a mystery. This phenomenon, in some metals at least, 
involves an interaction in which the slow vibratory motion of the atoms 
is involved. We conclude this because the transition temperatures 
for two different isotopes of the same metal depend on the atomic mass. 
This value of the isotopic mass would not be important if the transition 
were simply a matter of mutual interaction between electrons, or 
interaction of the electrons with an idealized array of fixed atoms. The 
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idealization that the atoms are fixed must be incorrect. But how does 
the motion of the atoms produce a sudden jump in specific heat in 
some metals and permit electrical conductivity below this temperature 
without resistance? This question was first answered in a convincing 
way by Bardeen, Cooper, and Schrieffer." The path integral approach 
played no part in their analysis, and in fact it has never proved useful 
for degenerate Fermi systems. 


The Planck Blackbody Radiation Law. The partition func- 
tion for any system of interacting oscillators is easily worked out. 
Such a system is equivalent to a set of independent oscillators of fre- 
quencies w; However, the value of free energy F for independent 
systems is the sum of the values of F for each of the separate systems, 
which we find directly from the sum of Eq. (10-2) to be 


hw 
kT In (2 sinh ar) 


This gives the free energy of a linear system as 
hwi 
F = kT In € sinh an) 


= kT 7 In (1 — ل (2*اسفحى‎ = 


(10-85) 


The last term in this expression is a ground-state energy of the system. 

For an electromagnetic field in a box of volume V, the modes are 
defined by the vector wave number K, two for each K. The zero- 
point energy is omitted. Thus the free energy of the electromagnetic 
field SERE per unit volume is 


F = kT ies oe si In (1 — e- Kel?) (10-86) 


The internal energy U is the partial derivative of BF with respect to 8 
which becomes (putting w = Ke) 


aK 1 


The volume element in K space can be written as 
2 
dK = 4rK? dK = 4r = dw (10-88) 


1 J. Bardeen, L. N. Cooper, and J. R. Schrieffer, Phys. Rev., vol. 106, p. 162, 1957, 
and vol. 108, p. 1175, 1957. 
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This means that the energy in the electromagnetic field in the range of 
frequencies from w to w + dw is 


2٠4ج‎ hw 
Orc) eont — 1 (10-89) 


This is the famous blackbody-radiation law discovered by Planck. It 
was the first real quantitative quantum-mechanical result discovered 
and was the first step in the discovery of the new laws. 

Another early quantum-mechanical triumph was the explanation of 
the temperature dependence of the specific heat of solids by Einstein 
and by Debye. This also comes from Eq. (10-85), but the oscillators 
are now the normal modes of the crystal, as described in Chap. 8. 
For example, the thermal energy per unit volume of such a crystal is, 
like Eq. (10-87) (leaving out the zero-point energy), just 


= hw(k) dk 
3p I J exp [ho(k)/kT] — 1 (2r)? (10-90) 


where w(k) is the frequency of a phonon of wave vector k. In a 
erystal, this is a multiple-valued function (there are 3pw values for 
each k if there are p atoms in a unit cell), and we must sum over each 
of the possible values of w for each k. The k integral extends only 
over the finite range proper for the crystal. For light there are two 
modes for each k, each of frequency w = kc, so the sum gives a factor 
2 and Eq. (10-87) results, the integral on k now going to infinity. 

The result of Eq. (10-90), studied in various approximations by 
Hinstein and Debye, gave a good accounting of the main features of 
the specific-heat curve, particularly the behavior at low temperatures, 
which had been in direct contradiction to the classical expectations. 
Today, putting a more complete knowledge of the phonon spectrum 
w(k) into Eq. (10-90) yields a completely satisfactory description of 
that part of the specific heat of solids due to internal vibration of the 
atoms. 


REMARKS ON METHODS OF DERIVATION 


The presentation of statistical mechanics given in the early part of 
this chapter leaves much to be desired. The fundamental law which 
shows that the probability for finding a system in the energy state E is 
proportional to e—#/*? is usually derived by considering the interaction 
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of complex systems over long periods of time. But an entertaining 
problem presents itself. 

We have started our discussion of physics in this book with a 
definition of the laws of quantum mechanics using a path integral 
technique, as shown in Chap. 2. Just as a question of curiosity, let 
us take the point of view that this is the fundamental law. Then 
ultimately these statistical properties of a system whose quantum- 
mechanical properties are defined by such a path integral are found to 
be expressible in terms of the partition function Z. This function 
can be defined by a path integral of an obviously very similar and 
closely related form, as shown by Eq. (10-77). Yet the derivation of 
this result requires noting the wave equation, the existence of station- 
ary states and eigenvalues, and the argument about interaction over 
long periods of time to which we referred, all of which leads to the 
expression (10-1) for the partition function in terms of the energy 
levels E;. Finally, we proceed to the reverse argument producing the 
path integral formulation for Z. Is there any way to derive the path 
integral expression for Z for a system in equilibrium directly from the 
path integral description for the time-dependent motion? Can we 
find a short cut which avoids the mention of energy levels altogether? 
If it is possible, we do not yet know how to do it. 

One might ask: Why try it at all? It is like showing that you can 
swim with your hands tied at your back. After all, you know there 
are energy levels. The only excuse for trying to avoid their mention 
would be that in so doing a deeper understanding of physical processes 
might result or possibly more powerful methods of statistical mechanics 
might be evolved. At any rate, it would be interesting to solve the 
problem. 

It was the promptings of a similar quest, to get the well-known 
variational principle permitting the computation of the lowest energy 
level of a system directly from the path integral formulation (instead 
of indirectly via the Schrédinger equation), which resulted in the 
methods described in Chap. 11. Thus the results of this apparently 
academic problem were of some use as well as of some interest. 

Nevertheless, if we prefer, we can suppose our desire for one par- 
ticular course in achieving a solution is prompted simply by an aca- 
demic interest in the methods of classical physics. Suppose that we 
have a system obeying the principle of least action, with the action 
defined by 


1 
S= 5 | madı + f Oz + a) at (10-91) 
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so that the equation of motion is 
AORE 
mt = 5 [x(t + a) + z(t — a)] (10-92) 


Here we have created the curious situation in which a particle is 
driven by a force depending on the average value of coordinates that 
were and that will be. There are exponentially exploding solutions of 
Eq. (10-92), but let us say that only motions for which z remains finite 
both in the distant past and in the distant future will be allowed. 
(Incidentally, it is likely that solutions which we wish to ignore are 
excluded anyway if the action law is stated as 6S = 0 for all variations 
of path dx subject to the restraint 5x »ك‎ 0 fort + œ.) 

For such a system it is possible to define an expression for energy 
which is conserved; for the equations of motion of the system do not 
depend on time. (No simple hamiltonian gives the equations of 
motion.) Presumably, such a system could possess properties which 
allow it, for example, to be perturbed by molecules of a gas and thus 
achieve thermal equilibrium. We might ask: What are the averages 
of various quantities describing a system obeying the equations of 
motion of Eq. (10-92) and appropriate boundary conditions at infinity 
when it is in equilibrium at the temperature T? Perhaps such a 
problem is not definable, or perhaps it is easily solved only in this 
special case because the equations of motion are linear. But the aim 
of these remarks is to ask whether the existence of a hamiltonian and 
momentum variables is indeed necessary to the formulation of classical 
statistical mechanics—or whether a wider class of mechanical systems 
can be analyzed, a system in which the equation of motion comes most 
simply from the principle of least action, even though that action 
involves more than the instantaneous positions and velocities of the 
particles in the system. 

This question is the classical analogue of our more interesting ques- 
tion, namely, how do we proceed directly from the path integral 
formulation of quantum-mechanical laws of a mechanical system to 
the path integral formulation of statistical mechanical laws for the 
same system in equilibrium? 


Problem 10-9 Show that the expression 
EW) = 5 mO 8 ETO 5 : [ew — ae) a’ (10-93) 


defines a conserved energy for the equation of motion (10-92). 


In general, for any action functional, like S, that does not involve 
the time explicitly (i.e., is invariant for the transformation t—> t + 
const) there is an expression H(T) for the energy at time T which is 
conserved. It can be found by asking for the first-order change in the 
action S when all paths are changed from x(t) to 2[t + n(d)], where 
n(t) = +¢/2 for t > T and n(t) = — ¢/2 for t > T, with constant e. 
ôS is then ef(T) for infinitesimal e. 


Problem 10-10 Discuss the problem of the path integral formu- 
lation of statistical mechanics for a particle which is in a time-constant 
magnetic field. y 
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The Variational Method 


In this chapter we discuss a method based on a variational principle 
for the approximate evaluation of certain path integrals. First, we 
shall illustrate the method by some examples. Later, we consider 
those problems for which the method may be useful. 


A MINIMUM PRINCIPLE 


Suppose we wish to evaluate the free energy ل‎ of a system. This 
problem can be expressed in terms of path integrals by starting with 
the partition function for the system defined in Eq. (10-4) as 


Z = ۴ (11-1) 


In Eq. (10-30) the partition function was expressed as an integral 
over the density matrix p(z,x). Then, in Sec. 10-2, a kernel expression 
for p(x,x) was developed. It allows us to write 


i= a 0 esl Da(u) day (11-2) 


so long as we use the “time” variable u as an imaginary quantity. 

In Sec. 10-3 we developed a perturbation technique for the evalua- 
tion of the path integral defining the partition function for certain 
special cases. We shall now describe another technique, applicable 
in those cases where S 25 real. For ordinary cases without a magnetic 
field (and no spin) S is real. 

Throughout the remainder of this chapter, we assume that we 
are choosing units in such a way that the value of i is 1. Whenever 
it is necessary to include 7 symbolically in order to visualize the 
quantum-mechanical character of a result, it can be so included by a 
straightforward dimensional inspection. 

Let us suppose that some other S’ can be found which satisfies 
two conditions: First, S’ is simple enough that expressions such as 
“كول‎ Da(t) or JGeS’ Dx(t), for simple functionals G, can be evaluated. 
Second, the important paths in the integral fe’ Dz(t) and those in the 
integral fes’ Dx(t) are similar, that is, S’ and S are similar when they 
are both small. Now suppose F” is the free energy associated with S’. 


That is, 

ea BP’ = ee Ne eS’ Da(u) 021 (11-3) 
so that 

SSeS Da(t) dx, _ EE 

JSE Da(t) dz, ° (11-4) 
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Then since eS = eS—8’eS’, we can write Eq. (11-4) as 
J fes-s'es' Da(t) des [f fe” Dali) dry! = e2- (11-5) 


This says simply that e-*-F” is the average value of e378’ where this 
average is taken over all paths with the same initial and final point 
and the weight of each path is eS’ Dz(t). All possible values of zı 
are included in the averaging process. 

One way to proceed now would be to suppose that S — S’ is small 
and that F — F’ is small and then expand both sides up to the first 
power in their respective exponents. This method appears dubious 
because (F — F’) is not small if 8 is large. However, comparison of 
higher-power terms shows that this is nevertheless a legitimate approxi- 
mation to F — F”. 

The argument can be made much more rigorous and powerful in the 
following way. The average value of e” when ند‎ is a random variable 
always exceeds or equals the exponential of the average value of 2, as 
long as x is real and the weights used in the averaging process are 
positive. That is, 


(e) = e# (11-6) 


where (x) = the weighted average of x. This follows because the curve 
of e? is concave upward, as shown in Fig. 11-1, so that if a number of 
masses (weights) lie along this curve, the center of gravity of these 
masses lies above the curve. The vertical height of this center of 
gravity is the average vertical position (e*) of the points. It 


Fig. 11-1 We assume the weighting factors a; are positive and look on them as different 
masses positioned along a string. Then the exponential of the weighted average of x, that 
is, e, must lie below the weighted average of the exponentials le”) because of the concave 
nature of the curve .2م‎ The value of ¢*) must lie in the curve, but (e*), the center of 
gravity of the several points, must lie above the curve. 


exceeds e“, the ordinate of the curve e” at the abscissa position of the 
center of gravity, which is the average value (x). 

On the left-hand side of Eq. (11-5) we take the average value of 
eS-S’ over paths with the positive weights eS’ Dx(t), where S’ and S 
are real. Hence, by Eq. (11-6), this exceeds the quantity e38’, 
where (S — S’) is the average of S — S’ with this same weighting 
scheme, namely, with the weight eS’ Dz(t). That is, 


(S — 8’) = ff(S — S'je Dz(t) dzı[f fes Dx(t) dz] (11-7) 
We have then 
e(S-S’) < e-A(F—-F’) (11-8) 


This result implies that 


Fo < Fi — 5 (8 — 8’) (11-9) 


Our final result is then 
F<F'—ô (11-10) 


where 
£ 3 if (S — Se Dx(t) dx | f f e5 Dx(t) dx | (11-11) 


It is very fortunate that we have a minimum principle here. It 
says that, if we calculate Fj — ô for various “actions” ,“كر‎ that calcu- 
lation which gives the smallest result is nearest to the true free energy 
F.t The energy F is actually obtained, of course, if S’ = S; but we 
can guess that if S and S’ differ in some sense to a first order of small- 
ness, then the derivation of F’ — 6 from F must be of second order. 

If only a reasonable general form of S’ can be guessed but certain 
parameters still remain uncertain, the calculation of F’ — ô can be 
made leaving these parameters undetermined. Then the nearest 
approximation to F will be the lowest F’ — 6 available. That is, 
the “best” values of the parameters are those which minimize F’ — ô, 
“best” in the sense that the resultant F’ — 6 differs least from the 
true F. 

This same minimum principle can be used to find an approximate 


f Again, it is worth emphasizing that neither S nor S’ is an action functional in 
the proper physical meaning of the term, since both are defined with the imaginary 
variable u used as the “time” variable. However, operations with path integrals 
are the same for these functionals as for proper physical actions used previously. 
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value for the lowest energy state of the system Eo. Recall that 

Z = GBF = > e-BEn (11-12) 
n=0 

As the temperature of the system becomes lower and lower, that 

is, as 8 grows larger, terms involving higher values of energy become 

less and less important in this series. Eventually the series for Z 

is dominated by the term of smallest energy, ef. That is, 

lim Z = «#قحع‎ (11-18) 

ه جم 

Now following the line of argument developed in the preceding para- 

graphs, we can simply replace F with Eo. We define Ej as the result 

of the path integral involving the new action S’ and finally derive 


Eo < Eo — ê )11-14( 


as an approximation in the limit of large 8. 

In approximating Eo by this technique, our task is somewhat simpler 
than it was for the free energy F. Specifically, we can disregard the 
specification that the initial and final points of the paths be the same. 
To understand this, we refer back to Eq. (10-28) and note that as 8 
becomes large the density matrix 8(z',z) is also dominated by the 
zero-order term and approaches e #*po(z’) %0 (x). Thus the depend- 
ence on x’ and z enters into a multiplying factor but does not affect 
the nature of the exponential behavior of the function. It is this 
exponential behavior which is fundamental in the evaluation of Eo by 
this technique. 


AN APPLICATION OF THE VARIATIONAL METHOD 
As an example of the evaluation of a partition function using this 
variational principle, consider the example of a single particle con- 


strained to move in one dimension. Using the approach developed in 
Chap. 10, we write the action for such a particle as 


S=— A (5 (t)? + ve) dt (11-15) 


So, for large values of 6, the partition function is 


enPEs بح‎ fj [exp (- 1 3 [r t? + vee} at)| Dx(t) dro (11-16) 
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This path integral is over paths which return to the initial starting 
points; and after the path integral has been evaluated, a further 
integration over all possible starting points is carried out. 

In Sec. 10-2 we considered this same problem and pointed out 
how the classical approximation may be derived by inspection. In 
the classical limit of high temperatures or high values of kT compared 
with 7, the value of 87 is so small that paths which get very far away 
from zo do not contribute. Thus, the potential can be replaced by the 
constant value V (zo), and the path integral contributes only a con- 
stant, giving 


es (classical) = = f e BV) dy (11-17) 


as shown in Eq. (10-48). 

In Chap. 10, one quantum-mechanical improvement was made on 
the classical result by expanding the potential about the average posi- 
tion of the path and using terms up through the second order in this 
expansion. Then a still greater improvement was achieved by using 
a potential U, developed in a particular averaging process. From the 
point of view of this chapter, we see that that approach was a special 
application of the variational method. To clarify this point, we shall 
review the key steps using the notation and concepts of this chapter. 

Thus we wish to derive a suitable trial function W(z), where Z is 
the average position of a path defined by 
3= x(t) dt (11-18) 

Along any particular path, this substitute for the potential is a 
constant, so that the new form of the action along that path becomes 


ee if Se dt — BW (2) (11-19) 


With this more general form, it is possible to calculate both F’ and 
(S — 8’). 

Proceeding along this course, we use Eq. (11-14). Substituting 
into this expression, we have 


- ff ff veena - wo) 


(11-20) 
JJ | مت‎ (= ff Barat) | {exp -wO D200 dz, 
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It is to be remembered that the paths to be used in the path integrals 
of Eq. (11-20) are those which have the same initial and final points, 
and, as in Eq. (11-16), a further integration over all end points zo is 
to be carried out. 

Note that the numerator of 6 is quite similar to the term J (3) 
introduced in Eq. (10-63), if we restrict ourselves to paths that have a 
specific average value 2 and count on integrating over all possible 
values of 2 at a later stage of the calculation. By the same arguments 
as were used in the discussion of 7)2(, we see that the numerator of 
ô is independent of 2. We can evaluate the path integrals in both 
numerator and denominator by the methods used in Chap. 10 and 
take the answer from Eq. (10-65), remembering that 


H > TT (11-21) 


Since the denominator is simply a special form of the expression appear- 
ing in the numerator, the result is 


JE [E ve - wan fex | - & e - |[نه‎ 
X (expl -BW )8([[ dzo dž 


EE ا‎ 


X {exp ]- 817 (2)]} dzo dž 


(11-22) 


The integral over zo in the denominator in Eq. (11-22) can be easily 
evaluated to give (8r/6m)”*. Furthermore, the integral over the 
term in the numerator containing the factor W (z) results in this same 
multiplying constant. It will be more convenient for our future work 
if we carry out that particular integration in the numerator and further 
simplify the resulting expression by defining the function V(#) as 


7@ = ee S| Væ {exp |- 0 (e, — a|} dzo (11-23) 


The form of V (z) reveals the quantum-mechanical effect we have 
introduced. This function is a weighted average of V(xo) with a 
gaussian weighting function just like the function U (zo) defined by 
Eq. (10-68), and the gaussian spread is again (@h?/12m)”. For a 
helium atom at a temperature of 2°K, this spread amounts to about 
0.7 Å. At room temperatures, however, it is only about 2 per cent 
of the 2.7-A diameter of the atom. The value of 5 can now be written 
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as 


s(W(@) — 7 )2([ وخ‎ [-6W )2([[ dz : 
J {exp [—BW (&)]} dz (11-24) 


The next step is to evaluate W (2) by the requirement that we obtain 
the minimum value for F’ — ô, as shown in Eq. (11-13). F’ is defined 
by 


exp (—BE%) = J e5 ندج‎ )0( 


=| {exp | - [i Fs dt — awe) || Da (t) 


= f {exp [-8W E) 


x a È (- ٤ oe at) Da(t) dz (11-25) 


The path integral is a simple one whose value we know to be 
4/m/2xr8, so that we obtain 


2 
The next step, finding the optimum choice for W (27), requires us to 
determine the effect of a small variation in the function W(z) on the 


value of F’ — ô and set this effect equal to 0. Thus, imagining W to 
be replaced by 


W جب‎ W(z) + n(@) (11-27) 
we find from Eq. (11-26) that the variation in F” is 


ô = 


exp (—BE4) = | مت‎ [-8W@)]} dž (11-26) 


, _ Jn(a) {exp [- 86W (2)]} dz 
oo = Slexp ]- 17م‎ J] 42 aaa” 


and the variation in ô from Eq. ae 1-24) is 


Ta = BWI} de 
f {exp [—sW(@}[W(@) — VE] 2 
X {8n(2) {exp [—6W(a)]} 2 
(exp ]- 877 )2([( di)? اكد‎ 


Finding a stationary value for the right-hand side of Eq. (11-13) 
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requires simply that 
ðE, — 06 = 0 (11-30) 
which will be true if we take 
W(z) = V@) (11-31) 


This, in turn, implies that ô is 0 and that F’ has the same form as the 
classical free energy defined in Eq. (11-17). However, the potential, 
in the definition of F’, has been replaced by 1)2(. That is, 


exp (—BE%) = J5 f (exp ]-87)2([( 2 (11-32) 


where V (æ), the effective classical potential, is given by Eq. (11-24). 
For large values of 8, the free energy is essentially the same as the low- 
est energy level Eo; thus we can interpret Eq. (11-32) as providing an 
approximation to Ho. This means that the variational approach has 
produced the same result as that obtained in Chap. 10 and shown in 
Eqs. (10-67) and (10-68). 


THE STANDARD VARIATIONAL PRINCIPLE 


There is in quantum mechanics a standard variational principle, called 
Rayleigh-Ritz method, which is this: If H is the hamiltonian of the 
system, whose lowest energy value is Ho, then with f representing any 
arbitrary function 


Eo < Jf*Hf d(vol) [Jf*f d(vol)} (11-33) 


This has wide application and is very easily demonstrated. If the 
function f be expanded as a series in the eigenfunction ¢, belonging to 
the hamiltonian, i.e., if f = Zanġn, it is evident that 


[FH avon | f rs d(vol) | = لأ‎ lanl*En [2 lanl? | (11-34) 


n 


This latter expression is an average of the energy values (with positive 
weights |a,|?) which therefore exceeds (or equals) the least value Eo. 
The principle expressed in Eq. (11-33) has characteristics similar to 
the principle of Eq. (11-13). In fact, Eq. (11-33) is a special case of 
Eq. (11-13). (To be more precise, we should restrict this conclusion 
to those cases for which the hamiltonian H is derived from a lagrangian 
which does not contain any magnetic field. Under this restriction, 
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then, the conclusion holds.) To see the relation between these two 
equations, we shall consider the following example: 
Suppose the action S is connected with a lagrangian such as 


L = Wmi? — V(x) (11-35) 


where V(x) is independent of t. (Otherwise, of course, there are no 
fixed energy levels to seek!) We shall limit ourselves to the case of a 
single variable x, but the general case follows directly. We note here 
that if the lagrangian contains the term «A—for example, if the 
lagrangian represents a particle in a magnetic field—then Eq. (11-33) 
is still correct. However, the action S is complex. In this case we 
suspect that Eq. (11-13) (or some simple modification of this equation) 
is still valid. However, this has not been proved. So, for the present 
we shall limit our discussion to a case in which no magnetic field is 
present. Then in the limit for large values of 8 we have 


f 6 8 ip 0 imi? dt + 4 اماما‎ at} ) لسع ~ مه‎ )11-36( 


Now suppose we use for our trial action S’ the form 


s= [ ° ma? dt — fE VEO at (11-37) 


which involves some other potential V’(z). This means that 


egos K {V'[e(D] — Viz(D]} dt (11-38) 


or 


wh TP ; =< 
eas 1 ef if {Viz(0] — Viel} at ae | f e )مد‎ |” 
(11-39) 
If we were to define the mean value of any function which depends on 
the path z(t) in such a manner as this, we would find that the value is 
nearly independent of ¢ so long as i was not too close to either 0 or 8. 
Therefore, to a sufficient approximation, we can write 
ô = —fe%{V[x] — V'[z]} De® [fe Dx} 
= (Vix()] = V'le()}) (11-40) 


Following the methods given in Chap. 2, we can evaluate this path 
integral if we assume that the functions ين‎ and EZ’, belonging to the S’ 
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are known. If our path goes from zı to æ», for example, we obtain 
(fle(]) = X {exp [= (8 — DE41} [exp (tE), (2) (fam 
X {X lexp )- ,8019م‎ (zof (e) }7* (11-41) 


R 
where 


Jam = So (x)f(a) €) dz (11-42) 


But, if 8 approaches infinity and ¢ is likewise large (for example, 
t= 8/2), all the higher exponentials are negligible compared to the 
exponential involving the lowest-energy term Hj. Thus in the limit 


(f) = foo (11-43)‏ د 
This result can be written as‏ 
dx (11-44)‏ وه da + fp V(r)‏ و9 fo V (x)‏ — = 


Of course, we must subtract this value from Hj. However, if H’ is the 
hamiltonian associated with S’, that is, if 


H!' = 2- 4+ V"(2) (11-45) 

then 

H'$o = 0 (11-46) 

so that 

E, — ô = foo H'a dz + foo V pode — foo V' ون‎ dz (11-47) 

But the true hamiltonian can be written as 
-Ż4yv-Ż4v+v-V-HE+Vv-v (11-48) 

m 2m 


and this means that 
Eo < [o*Ho, dz (11-49) 


where ¢; is normalized and is the wave function corresponding to the 
lowest energy state of the hamiltonian given in Eq. (11-45). We note 
that the estimate of the lowest energy level, as given in Eq. (11-49), 
involves the arbitrary potential V’(x) only through the wave function 
.زب‎ Since this potential was arbitrary, so is the wave function ġo. 


„ Therefore, instead of choosing an arbitrary potential and finding 


from it the resulting wave function and then proceeding to evaluate 
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Eq. (11-49), we could instead pick the wave function itself and then 
evaluate Eq. (11-49) without ever bothering about the potential to 
which this arbitrary wave function belongs. The variable function in 
this process is then the wave function رو‎ rather than the potential 
function V’(z). We find, then, that this result is simply another way 
of stating the result of Eq. (11-33). 

If the problems such as the one given in this example were the only 
ones in which the concept expressed in Eq. (11-13) were useful, then 
there would not be much point to this long discussion. But there are 
much more complicated integrals for which Eq. (11-13) can be used in 
a way that, at least as far as we can tell, is not so easily transformable 
into Eq. (11-33). We shall describe such an example in the next 
section. 


SLOW ELECTRONS IN A POLAR CRYSTAL? 


We imagine an electron moving in a polar crystal, such as sodium 
chloride. The electron interacts with ions, which are not rigidly fixed. 
Thus, the electron creates in its neighborhood a distortion of the crystal 
lattice, and if the electron moves about, the region of distortion moves 
with it. This electron, together with its distorted environment, has 
been called a polaron. 

One consequence of the distortion of the lattice is that the energy 
of the electron is lowered. Furthermore, since as the electron moves 
the ions must move to adjust the distortions, the effective inertia of the 
electron (or to use the currently accepted term, the mass of the polaron) 
is higher than simply the mass the electron would obtain if the lattice 
were composed of rigidly fixed points. The precise motion of such a 
polaron analyzed quantum-mechanically is exceedingly complicated. 
We shall, however, make a number of approximations whose justifica- 
tion in the real case may be quite difficult. Nevertheless, we shall 
arrive at an idealized problem which has been studied by a number of 
physicists. It has been studied not only because of its possible con- 
nection with the real behavior of an electron and a crystal, but also 
because it represents one of the simplest examples of the interaction 
of a particle and a field. The path integral variational method has 
been very successful in the solution of this idealized problem. 


1R. P. Feynman, Slow Electrons in a Polar Crystal, Phys. Rev., vol. 97, p. 660, 
1955. 

? For example, H. Frohlich writing in Advanc. Phys., vol. 3, p. 325, 1954. 
References to other works are given in this article. 
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First, we note that even if the ions were rigidly fixed in the crystal, 
the electron would still move in a very complicated potential. In 
such a case, one can show that there are solutions of the Schrédinger 
equation for the electron with characteristic wave numbers k. The 
energy levels of these solutions are generally very complicated func- 
tions of the wave number. Nevertheless, we assume that the relation 
between the energy E and the wave number k is still a quadratic form, 
such as 


hk? 
= (11-50) 
where m is a constant (not necessarily the mass of an electron in a 
vacuum). Next, we note that the force which the electron exerts on 
the lattice is such as to push away the negative ions and attract the 
positive ions. The motion of these ions will be analyzed by consider- 
ing them as a set of harmonic oscillators and employing the methods of 
Chap. 8. However, we shall assume that the only harmonic modes 
which we need are those with high frequency, in which ions of opposite 
sign of charge move in opposite directions. The frequency wx of each 
mode then really depends on the wave number k of the mode. How- 
ever, we shall neglect this dependence and assume that w is a constant. 

Our object is to find the electrical force generated by a distortion 
characterized by the wave number k and find the interaction of the 
electron in this force. Here, we neglect the atomic structure and 
treat the material of our crystal as simply a continuous dielectric 
which carries waves of polarization. If P is the polarization, written 
in the form of a longitudinal wave 


eas T ae" (11-51) 


then the charge density from the ions is 

p= V.P = kape" (11-52) 
If the potential is V, we have 

VV =p (11-53) 


Thus if q, is the amplitude of the kth longitudinal running wave, the 
polarization ax is proportional to gx, and the interaction between the 
wave of polarization and the electron is proportional to the sum over 
all values of k of the terms (q/k)e™**. 

Since the energy and the momentum of the electron are related by 
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the expression E = p?/2m, we can write the lagrangian of the entire 
system as 


% 
L=5 Ti J 32 (wage JET SS J j mer (11-54) 
k 


The first term of the expression is the energy of the electron in 
a rigid lattice, where r is its position. The second term represents the 
lagrangian of the oscillations of polarizations taken alone, where it is 
assumed that all waves of polarization have the same frequency and the 
coordinate of the kth mode is q. The last term is the lagrangian of the 
interaction between the electron and the lattice vibrations, where V in 
this expression represents the volume of the crystal and g is a constant. 
To simplify writing of all our subsequent formulas, we have written 
this in dimensionless form. That is, the scales of energy, length, and 
time are so chosen that not only 7 but also the common frequency w 
of the oscillators and the mass m of the electron are all unity. The 
coupling constant a is then the dimensionless ratio 


eee) 01-55 


where e and e. are the static and high-frequency dielectric constants, 
respectively. In a typical case, such as the crystal of NaCl, the value 
of amay beabout 5. The values of energy which we shall calculate are 
in units of hw. 

Now we can study the quantum-mechanical motion of the electron, 
solving the motion of the harmonic oscillators completely. For exam- 
ple, the amplitude that the electron starts at rı with the oscillators 
in the ground state and ends at the position rs with the oscillators 
still in the ground state is 


Goo(2,1) = fets 22) (11-56) 
where we have used the results of Chap. 8, and 


dk 
(2r)? 


If we a the ل‎ over the wave numbers k, we find 


"dt + JA va TA eik-r(e—ik-r(s)e—ilt—s| di ds -< 


(11-57) 


AAT سا‎ Sal FO" d: (11-58) 


This quantity Goo(2,1) will depend upon the initial and final positions 
of the electron, rı and rz, and upon the time interval we are considering, 
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T. Since this function is a kernel, it is a solution of the Schrödinger 
wave equation, considered as a function of the time interval T. 
Therefore, we realize that it will contain frequencies in its exponentials 
which are proportional to the energy levels Em. It is the lowest one 
of these energy levels which we now seek. 

In developing our variational principle, as we have explained, 
we are not interested in the kernel for real time intervals T. Instead, 
we want quantities such as those which appear in Eq. (11-8) for large 
values of 8. By following all the steps leading to Eq. (11-58), it can 
be readily shown for imaginary values of the time variable 8 that the 
resulting kernel has the form 


K(2,1) = fe’ Dr(t) (11-59) 


where the variable t goes from 0 to 8 and 
e7 lt—sl 


1 a 
f= al at + جک‎ || FO — r] ف‎ )11-60( 


This result is just that which one might expect from the replacement 
of t by the imaginary value zt in Eq. (11-58). Asymptotically, for 
large values of 8, this kernel becomes proportional to ,جع‎ 

We now have a relatively complicated path integral on which to try 
our variational principle. Next, we shall have to choose some simple 
action S’, which roughly approximates the true action S, and then find 
E’ and ô. 

We note that in Eq. (11-60) the particle considered at any particular 
time! “interacts” with its position at a past time by a reaction which 
is inversely proportional to the distance traveled between these two 
times and which dies out exponentially with the time difference. The 
reason for this is that the disturbance set up by the electron in the 
crystal lattice in the past takes some time to die out. That is, it 
takes some time for the ions to relax, and here in this relaxation period 
the electron still “feels” the old disturbance. 

We shall try an action S’ which has this same property, except 
that instead of involving the inverse distance as a coupling law, the 
attraction will have the geometric form of the parabolic well. This 
‘would be a poor approximation if the distance |r(s) — r(t)| could very 
often become exceedingly large. However, since there is a limited 
time available before the exponential time factor forces the interaction 
to die out, large values of this difference will not make any important 


dr |? 
dt 


1 Although ¢ in Eq. (11-60) is not really the time, but an integration variable 
instead, it is useful to think about it as a time, just as we did in Sec. 10-2. 
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contribution to the integral. Thus, we shall try 
S' = -14[ |t|? dt — WCSfir() — r(s)|2e-w!*! dt ds (11-61) 


The constant C is a measure of the strength of the attraction 
between the electron and the previously created disturbance. We take 
this as an adjustable parameter. ` Furthermore, we can with no extra 
difficulty permit the exponential cutoff law to contain the adjustable 
constant w, which can be different from unity. With this extra 
parameter we can partly compensate for the imperfection which we 
have introduced by replacing the inverted distance effect by a parabolic 
effect. [We also note in this regard that adding an extra constant 
to the parabolic term |r(¢) — r(s)|? leads to no further freedom, since 
such a term would drop out in evaluating a formula for Ej.] We 
shall adjust variable parameters C and w later in the evaluation in 
order to make Ej a minimum. 

Since the action S’ we have picked is quadratic, all of the path 
integrals which result are easily worked out by the methods described 


in Chap. 2. 

By comparing Eqs. en and (11-61), we find that 
S Seas S^) N l—si 7 

5 BG ik (CORSON, ‘veil. 


+40 f (le — r(s)[ ell ds 
=A+B (11-62) 


We shall concentrate our attention on the first term on the right- 
hand side of this equation, A. In this term we can express |r(t) — 
r(s)|-! by a Fourier transform. As a matter of fact, this term is the 
result of the Fourier transform involved in the step between Eqs. 
(11-57) and (11-58). So we have 


Ir(t) — r(s)|-! = f d'k exp {zk ٠ [r(t) — r(s)](242k?)—}} (11-63) 
For this reason we need to study 
Ss! ik . [r(r) — r(e 
(exp (ik [r() — r(o)]}) = Le ep te) لام‎ Dr 
(11-64) 
The integral in the numerator is of the form 


(e |G) 


dhe 30 ff O = rOet dt ds 


f(t) r() a|) Dr(t) (11-65) 
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where specifically 
f(t) = ikô(t — r) - ikô(t — o) (11-66) 


Now we shall evaluate Eq. (11-65) in so far as it depends on f or k aside 
from a normalization factor which drops out in Eq. (11-64). Inci- 
dentally, let us notice that the three rectangular components separate 
in Eq. (11-65) and we need consider only a scalar case. The method of 
integration is the same as that introduced in Chap. 2 for the evaluation 
of gaussian path integrals. Thus we substitute X(t) = X’(t) + Y(O, 
where X’(t) is that special function for which the exponent is maxi- 
mum. The variable of integration is now Y(t). Since the exponent 
is quadratic in X(t) and X’ renders it an extremum, it can contain 
Y (t) only quadratically; so Y then separates off as a factor not contain- 
ing f, which may be integrated to give an unimportant constant 
(depending on T only). Therefore, within such a constant 


I = exp [— 46JX"(t) dt — WCSI[X’D — X"(s)PPeel- dt ds 
+ FOX’ (Ð dt] (11-67) 


where X’ is that function which minimizes the expression [subject for 
convenience to the boundary condition X’(0) = X’(T) = 0 if the 
time interval is 0 to T]. The variational problem gives the integral 


equation 

PXO L 20 f [XO — Xe“! as — f(D (11-68) 
Using Eq. (11-68), Eq. (11-67) can be simplified to 

I = exp [JwfOX'( dt] (11-69) 


We need merely solve Eq. (11-68) and substitute into Eq. (11-69). 
To do this, we define 


2 = 5 Í elel X’ (s) ds (11-70) 
so that 
PZW = wiz) - X'0) (11-71) 


while Eq. (11-68) is 


PX _ 40 
UH 


di = AX’ — 201 - f0 (11-72) 
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The equations are readily separated and solved. The solution for 
X’(t) substituted into Eq. (11-69) gives, for the case of Eq. (11-68), 


= exp {tk- [X() — X@)]} 


2Ck? 2 ١ 
= exp |- 1 ان‎ ei 33 ker = a| (11-73) 


where we have made the substitution 


vw 1 se (11-74) 


The result is correctly normalized, since it is valid fork = 0. Upon 
substitution from Eq. (11-73) into Eq. (11-63) there results an integral 
on k which is a simple gaussian, so that substitution into A gives 


7 ES Ay] 14 
Ase rZ ; | wr + CEE | e~ dr (11-75) 


To find B, we need {|r(t) — r(s)|?). This can be obtained by expanding 
both sides of Eq. (1 oa with respect to k up to order k?. Therefore 


1 
3 (Ir) — r(@)|?) = = (1 area. eal (11-76) 
The integral in B is now easily performed and the expression simplifies 
to 

3C 
B oir (11-77) 


‘Finally we need E’, the energy belonging to our action S’. This is 


most easily shed by differentiating both sides of Eq. (11- a with 


respect to C. One finds immediately 


C dE: 
<a = B (11-78) 


so that, in view of Eqs. (11-77) and (11-74), integration gives 


Ey = 40 - w) (11-79) 


since Hy = 0 for C = 0. Since Eû — B = (3/4v)(v — w)?2, we finally 
obtain for our energy expression 


7 žo REET (11-80) 
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with A given in Eq. (11-75). The quantities v and w can be con- 
sidered as two parameters which may be varied separately to obtain a 
minimum. 

The integral in A, unfortunately, cannot be performed in closed 
form, so that a complete determination of 77 requires numerical inte- 
gration. It is, however, possible to obtain approximate expressions 
in various limiting cases. The case of large a corresponds to large v. 
The choice w = 0 leads to an integral 


Shere ا د‎ al(1/v) 
A = Hanis f 0 de 1 درس ب‎ a )11-81( 


and Ey = 30/4. It corresponds to the use of a fixed harmonic binding 
potential in Eq. (11-37). For large v, e-** can be neglected, so that 
A = (rw) “av. For a less than 5.8 and w = 0, Eq. (11-80) does not 
give a minimum unless v = 0, so that the w = 0 case does not give a 
single expression for all ranges of a. In spite of this disadvantage, the 
result with Eq. (11-81) is relatively simple and fairly accurate. For 
a > 6, only fairly large values of v are important, and the asymptotic 
formula (good to 1 per cent for v > 4) 


Ante 9j (1 oo 2 )11-82( 
T v 


is convenient. Frohlich, however, considers the discontinuity at 
a = 6 as a serious disadvantage—a disadvantage which can be avoided 
in our present approach. This we do by choosing w different from 
zero. 

Let us study Eq. (11-80) just for small a, in case w is not zero. The 
` minimum will occur for v near w. Therefore, we write v = (1 + ew, 
consider e small, and expand the root in Eq. (11-80). This gives 


9 0 
A= |: =e / د‎ a fein a | (11-83) 
The integral is 
2w-[(1 + w)* — 1] =P (11-84) 


The problem of Eq. (11-80) then corresponds, in this order, to mini- 
mizing 


E hed ay ei (BaP) (11-85) 
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That is, 


a 2a(1 — P) 


37 )11-86( 


which is valid for small a only, because e was assumed small. The 
resulting energy is 
ا ا‎ E 


E= a متحت‎ (11-87) 


Our method therefore gives a correction even for small a. It is least 
for w = 3, in which case it gives 


IS RA tne © )11-88(‏ قر بقار 


It is not sensitive to the choice of w. For example, for w = 1 the 1.23 
falls only to 0.98. The method of Lee and Pines! gives exactly the 
result of Eq. (11-88) to this order. The perturbation expansion has 
been carried to second order by Haga,? who shows that the exact 
coefficient of the (a/10)? term should be 1.26, so that our variational 
method is remarkably accurate for small a. 

The opposite extreme of a large œ corresponds to large v and, as 
we shall see, w near 1. Since v < w, the integral in Eq. (11-76) 
reduces in the first approximation to Eq. (11-81), which we can use in 
its asymptotic form. The next approximation in w can be obtained 
by expanding the radical in Eq. (11-75), considering w/v >> 1. Fur- 
thermore, e~* is negligible. In this way we get 


3 A 2 n2  w? 


This is minimum, within our approximation of large v, when w = 1 and 
v = (4a?/9r) — (41n? — 1). Then we find? 


S EN, Za 3 s oe 4 
R= 37 5 112 4 0.1061la 2.83 (11-90) 


1T. Lee and D. Pines, Interaction of a Nonrelativistic Particle with a Scalar 
Field with Application to Slow Electrons in Polar Crystals, Phys. Rev., vol. 92, 
p. 883, 1953. 

2E. Haga, Note on the Slow Electrons in a Polar Crystal, Progr. Theoret. Phys. 
(Kyoto), vol. 11, p. 449, 1954. 

3S8. I. Pekar in Theory of Polarons, Zh. Eksperim. i Teor. Fiz., vol. 19, p. 796, 
1949, has shown that E goes as —0.1088qa? for the case of large a. 
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TABLE 11-1 
a 3 11 
v 3.44 9.85 15:9 
w 2.55 1.28 1.15 
E; —3. 1333 —11.486 —15.710 
Eip —3.10 — 9.95 — 12.41 
E, —3.09 — 9.65 —11.88 
Ene —10.31 —14.7 


The approximations do not keep E as an upper limit because, unfor- 
tunately, the further terms, of order 1/a*, are probably positive. 

Detailed and numerical work based on this approach has been carried 
out by T. D. Schultz.! Using a digital computer, Schultz worked out 
values of v and w which would give a minimum for several different 
values of a. He also evaluated E and compared it with the values 
which would be obtained from several alternative theories. In par- 
ticular, he worked out the self-energy from the theories of Lee, Low, 
and Pines (Ey1p),2 Lee and Pines (£1,),? Gross (E,),* and Pekar, 
Bogolubov,* and Tyablikov’ (Ept). 

The results for a, e, and w and also for the energies given by the 
Feynman theory (Z,) compared with energies derived from the other 
theories are given in Table 11-1, which is reproduced from the paper of 
Schultz. In this table, both w and 7 are assumed to have the value 1. 
Note that for all values of a, the value of EH; is less than all others. 


1T. D. Schultz, Slow Electrons and Polar Crystals: Self-energy, Mass, and 
Mobility, Phys. Rev., vol. 116, p. 526, 1959. 

2Lee, Low, and Pines, The Motion of Slow Electrons in a Polar Crystal, 
Phys. Rev., vol. 90, p. 297, 1953. 

3 Op. cit. 

4E. P. Gross, Small Oscillation Theory of the Interaction of a Particle and a 
Scalar Field, Phys. Rev., vol. 100, p. 1571, 1955. 

5S. I. Pekar, “Untersuchungen über die Elektronentheorie der Kristalle,” 
Akademie-Verlag, Berlin, 1954. 

6 S. I. Bogolubov, On a New Form of the Adiabatic Theory of Disturbances in 
the Problem of Interaction of Particles with a Quantum Field, Ukr. Mat. Zh., 
vol. 2, p. 3, 1950. 

7§. V. Tyablikov, An Adiabatic Form of Perturbation Theory in the Problem 
of the Interaction of a Particle with a Quantum Field, Zh. Exsperim. i Teor. Fiz., 
vol. 21, p. 377, 1951. 
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Other Problems in 
Probability 


In the preceding chapters we have seen how to use path integrals to 
treat a number of quantum-mechanical problems which are, by their 
very physical nature, probabilistic problems. We have also used the 
path integral method to analyze some aspects of statistical mechanics 
wherein the probabilistic nature of the functions permitted the path 
integral technique to be particularly effective. We can continue this 
line of development into a wide variety of probability problems where 
this approach has special and valuable applications. 

It is the purpose of this chapter to explore a number of these proba- 
bility problems. They will be of two kinds. First we shall discuss 
direct applications of path integral ideas to classical probability prob- 
lems. This is quite different from all preceding chapters, in which all 
applications were to quantum mechanics. Following that, we shall 
deal with problems in which both probability and quantum mechanics 
are involved. We cannot, in this chapter, deal with these matters in 
any detail. We shall only outline by some examples how certain 
problems may be set up and thereby suggest to the reader other appli- 
cations of the path integral approach. 

The main direct application of path integrals to probability problems 
is due to the ability of path integrals to deal directly with the notion 
of the probability of a path or function. To make this idea clear, we 
proceed in steps from the well-known! ideas of probability applied to 
discrete events and to continuous variables. 


RANDOM PULSES 


To start with, suppose we consider a typical probability problem for a 
discrete variable. Weare given a situation in which a series of discrete 
events is taking place at random times, e.g., cosmic rays striking a 
detector or raindrops falling on a specifically demarked area of ground. 
We know that the particles fall at random times, but in any long period 
of time T we expect ñ = Ty particles will be observed. In other words, 
u is the mean counting rate. 

Of course, in any actual measurement the exact number of particles 
n recorded will not, in general, correspond to the expected number. 
But we can ask directly: What is the probability of observing a particu- 
lar number n of particles during a period when the expected number of 
particles is 7? It is given by the Poisson distribution 

n” 


Ea = nle” (12-1) 


1 Harold Cramér, “Mathematical Methods of Statistics,” Princeton University 
Press, Princeton, N.J., 1951. We assume knowledge of usual probability theory. 
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On the other hand, we might ask a probability question of a different 
kind. We might, for example, ask: What is the probability that the 
interval from one particle impact to the next will be some particular 
time t? Actually, there is no correct answer to the question phrased 
this way. If we were to ask the probability that the time interval will 
be equal to or greater than t, then we could give an answer [it is 
exp (—uyt)]. That is, we can get an answer to a question about t falling 
within a certain range. Thus, if we are interested in a particular value, 
we must allow ourselves an infinitesimal range and ask the question: 
What is the (infinitesimal) probability that the time interval will fall 
within the range dt centered around t? The answer is written as 


P(t) dt = pe~ dt (12-2) 


So we create a concept of a probability distribution of a continuous 
variable: P(t) is the probability per unit range of t that the interval is 
t. We write the probability distribution of ند‎ as P(x) if P(x) dz is the 
probability that the variable lies in the range dz about 2. We can 
easily extend this to two variables and write the probability distribu- 
tion of x and y as P(x,y) dx dy. By this we mean that the probability 
of finding the variables z and y in the region R of the zy plane is given 


by ie P(z,y) dx dy. 

We wish to expand the concepts of probability still further. We 
want to consider the distribution not of single variables but of complete 
curves; i.e., we want to construct probability functions, or rather func- 
tionals, which will permit us to answer the question: What is the proba- 
bility of obtaining a particular time history of a physical phenomenon, 
such as the voltage in a resistor or the price of a commodity, or, in two 
variables, the probability of a certain shape of the surface of the sea 
as a function of latitude and longitude? Thus, we are led to consider 
the probability of a function. 

We shall write it down this way. The probability of observing the 
function f(t) is a functional 2]7)0([. But we must be careful to remem- 
ber that questions relating to such a probability have meaning only if 
we define the range within which we are looking for a specific curve. 
Just as in the example above we had to ask the question: What is the 
probability of finding the time interval within the range dt? so now 
we must ask: What is the probability of finding the function within 
some more or less restricted class of function, for example, those curves 
which are bounded between values a and b for the complete time history 
in which we are interested? If we call such a subset of curves the 
class A, then we ask: What is the probability of finding f(t) in the class 
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A? and we write the answer as the path integral 


fi PIO 70 (12-3) 


where the integral extends over all functions of class A. 

Actually, this expression can be.thought of as similar to the proba- 
bility function for a number of different variables. If we imagine time 
to be divided into discrete intervals (as we imagined it when we were 
first defining path integrals in Chap. 2) taking on the values of tı, ts, 
. . . , then the values of the function at those particular times f(t), 
f(t), . . . = fı, fo, . . . are analogous to the variables of a multi- 
variable distribution function. The probability of observing a particu- 
lar curve can then be thought of as the probability of obtaining a par- 
ticular set of values fi, fe, . . . in the range dfı, dfs, . . . , that is, 
P(fife, o .) dfidfe- ++. 

If we then proceed to the limit as the number of discrete intervals 
in time becomes infinite, we obtain the probability of observing the 
continuous curve f(t) in the range 27) as the integrand in the path 
integral of Eq. (12-3). It is this probability concept and this proba- 
bility functional with which we shall be working in the remainder of 
this chapter. 


CHARACTERISTIC FUNCTIONS 


It is helpful to continue using the analogy between the probability 
functional of a path and the more traditional probability function of a 
variable. A number of concepts, such as the concept of a mean value, 
are common to the two approaches. With usual probability distribu- 
tions for quantities which have discrete values, so that the probability 
of observing the specific number n is Pa, the mean is 


8 


nPq = 7 )12-4( 


For a continuously distributed variable, it is 


fr: xP(2) de = & . (12-5) 
and in an analogous fashion, the mean value of the functional QIO] 
is written 


SQUOIPIO] DO _ 
-IPVO  ~ ا‎ 
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In this last equation, as in Chap. 7, we have included a path integral 
in the denominator to remind ourselves that we are always faced with 
normalizing problems. In principle, it would be possible to work out 
the path integral of the distribution function, set it equal to 1, and so 
evaluate the normalizing constant to begin with. However, in many 
practical cases it is more convenient to leave the function unnormalized 
and simply cancel out factors on the top and bottom of the expression 
which might, in actuality, be extremely difficult to evaluate. 

Just as the mean value of the function can be expressed in the path 
integral notation, so can the mean-square value of the function at a 
particular time, say, = a. Thus, 


Mra = OPPO OIC een DE 0 | (12-7) 


for this is only a special functional. 

One of the most important mean values of a function, as evaluated 
with Eq. (12-5), is the mean of ez. It is called the characteristic 
function, and it is 


p(k) = (ez) = ‘phe e*P(2) dx (12-8) 


This is sometimes also called the moment-generating function. It is 
simply the Fourier transform of P(x), and it is an extremely useful 
function for evaluating various characteristics of the distribution, since 
it is equivalent to a knowledge of the distribution function itself. This 
last fact is the result of the possibility of performing the inverse trans- 
form as 


Plaj = Vie و عسي‎ )1:( dk (12-9) 


A number of important parameters of the distribution can be deter- 
mined by taking the derivatives of the characteristic function. Thus, 
for example, the mean value of ند‎ is 


(a) = — 8H 


as is readily demonstrated by differentiating each expression in Eq. 
(12-8) with respect to k and then setting k = 0. In fact, a series of 
such relations exists: 


0 SU SOSA) كا "ان‎ Gy +: (12-11) 


Of course, our next step is to generalize the concept of the character- 
istic function to the functional distribution case. We could construct 


(12-10) 


k=0 
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a mathematical definition of such a characteristic function by returning 
again to our picture of discrete time intervals. We then wish to per- 
form the Fourier transform on the probability function of a large 
number of variables, using the kernel exp (ikıfı) exp (ikafa) ٠ ° ° . 
As we go to the limit of an infinite number of time intervals, this 
becomes simply e/*®/ æ, This, then, is the functional whose mean 
value we wish to take in order to develop the characteristic functional. 
By using Eq. (12-6), we obtain 


: ظ HPF] OF)‏ ممصو _ 
ل سن بمج = BRO]‏ 


This characteristic functional also has important special properties. 
For example, 6(0) = 1, and the mean value of the function f(t) evalu- 
ated at the particular time 4 = a is 
6 
= —1——~ Okit 12-13 

(f) = -iga MRO), (12-13) 
where we have used the technique of the functional derivative as 
described in Sec. 7-2. 

In principle, we can invert our path integral Fourier transform and 
write the probability functional as 


PIS] = fetoro #B[k(t)] Dk() (12-14) 


where now, of course, the path integral is carried out in the space of the 
k functions. 

We may remark, for use in interpretation later on, that if the func- 
tion f(t) is not uncertain but is definitely known to be some special 
function F(t), that is, P[f(t)] is zero for all f(t) except f(t) = F(t), then 
the characteristic function is 


= gilk(t)F() at (12-15) 


NOISE 


Suppose we apply the ideas so far developed to a particular example 
and in the process develop a few more concepts. Let us consider the 
situation in which we are counting some sort of pulses, perhaps pulses 
generated by the impact of cosmic rays on a Geiger counter or perhaps 
thermal-noise pulses in a resistor. In such cases the pulses are not 
simply discrete spikes of energy but are represented by a rising and 
falling voltage. Thus, careful inspection of the actual voltage history 


326 


12-3 


Other problems in probability 
327 


associated with such a pulse would show that it has the form g(t) fora 
pulse occurring at t = 0. So, if the pulse occurred at to, the shape of 
the voltage curve would be g(t — to). 

Now, suppose we conduct our counting experiment for the time 
interval of length 7 during which a number of pulses centered on the 
times fı, t2, . . . ,f, occurred. The complete voltage history over this 


experiment would be 0 g(t — t;). Since we know when all the events 
=1 


occurred, our probability function would simply be the representation 
of certainty, and by use of Eq. (12-15) the corresponding characteristic 
function becomes 


& = exp [5 | Ost- t) dt) - (12-16) 


But now suppose that we wish to determine the probability of find- 
ing a particular time history for the voltage before conducting the 
experiment. In that case we permit the n events to be randomly dis- 
tributed with uniform probability over the complete time interval. 
That is, the probability of an event happening within the time interval 
dt is dt/T. In this case the characteristic functional becomes 


2 ax = dti dt tn 
nie I, exp 2 k(t)g(t — t) alos F 


= 58 eisk(t+s) g(t) dt FY (12-17) 


We call the expression in parentheses A and write this result as A”. 

If the number of events in the time interval is distributed in such a 
way that the Poisson distribution applies, i.e., the occurrence of each 
event is independent of the time of occurrence of any other event and 
there is a constant rate u for the expected number of events per unit 
time, then the expected number of events in the time interval T is 
„T = ñ. The characteristic function is 


ñr _ : 
s=) Ane (12-18) 


The sum on the right-hand side of this equation is the expansion of an 
exponential function, so that we can write the characteristic function as 


e-G-A)a = exp | —pT (1 =a Ja etsk(t+s) g(t) dt | 


exp [ = 8 (1 — efSkete0@ dt) ds | )12-19( 
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Thus, we may determine the characteristic function for many different 
situations. We next go on to discuss this result under various approxi- 
mate circumstances. 

Suppose we imagine that the pulses get very weak while the expected 
number of pulses per unit time, that is, u, becomes large. In that case 
g(t) is small, so we can expand e‘/*¢+#)0® dt in a power series and we 
can approximate the characteristic function as 


exp | tu ik F k(t + 990) dt ds | = exp | inG ie 00 at | (12-20) 


where we have used the substitution G = fg(¢) dt, the area of the pulse. 
This means that © is in the form of Eq. (12-15) with F(t) = „G (a con- 
stant independent of t). That is equivalent to saying that f(t) is 
certainly بر‎ or, in other words, that there is unit probability for 
observing the function f(t) = „G and zero probability for observing 
any other f(t). That is to say, the pile-up of a large number of small 
pulses generates a nearly steady direct voltage of value equal to the 
number of pulses per second بر‎ times the average voltage © supplied by 
each. Next, we go to one higher approximation and study the fluctua- 
tions or irregularities of this nearly direct voltage. 

Equation (12-20) is a first-order approximation to the exponential 
eiskt+s)0® d in the description of the characteristic functional of Eq. 
(12-19). Suppose now that we go on to the next-order approximation 
and include the second-order term. This is 


-4 f Í k(t)g(t + s) dt f k()g(t + s) dt! ds (12-21) 


To simplify this expression, we define a function which measures the 
overlap between two nearby pulses as 


A(r) = fg(Dg(t + r) dt (12-22) 


By use of this substitution, the second-order term is reduced to 


=E / “ jk "KOKOA — t’) dt dt! | (12-23) 


Including both first- and second-order terms, the characteristic 
functional is 


DP = EtGSk(t) dte—(u/2) f SKOKA) dt dt’ (12-24) 


The first factor in this expression is the constant average level, which 
we might call the d-c level if we are thinking about voltage pulses. 
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We can, if we wish, neglect this level and concentrate only on the 
variations around it by shifting the origin of f(t). That is, we can 
always take out a factor e/*®?® & by shifting the origin of f(t) [i.e., by 
writing f(t) = F(t) + f’( and studying the probability distribution of 
J‘ (t) and its characteristic functional]. If we make this change of origin, 
we are in a position to study the fluctuations of voltage around the 
d-c level., 

We note one special approximation to Eq. (12-24) which is often 
adequate. Generally, \(r) is a narrow function of r. The pulse shape 
g(t) rises and falls with a finite width, so if two pulses are spaced a 
very great distance apart, their overlapping area vanishes. This is 
another way of saying that (7) approaches 0 rapidly as r becomes 
large. As a result of this, if \(r) is narrow enough, the second term in 
Eq. (12-24) can be approximated by 


[k(t)]? dt (12-25)‏ [(2/و) سم 


“where we have used the substitution 90-4 i E Adr. This is equiv- 


alent to the probability distribution 
PIJĄ] = SuN? a (12-26) 


Such fluctuations as we are describing here are often called gaussian 
noise. 

Characteristics of distribution functionals describing noise functions 
have been studied extensively in recent years in the theory of communi- 
cations. A number of characteristics of noise spectra have been 
defined and evaluated, and we shall carry through similar discussions 

ı here and in the next section, where we treat gaussian noise. 

Now we shall continue to show, by giving one further example, how 
characteristic functionals are set up. We shall consider pulses which 
come at random times all with a given characteristic shape, say, u(t), 
but each with a different scale height, so a typical pulse is written au(t). 
We might allow the height a to be either plus or minus. So now we 
suppose the timings of the pulses are randomly spaced instants ¢; and 
the heights take on random positive and negative values a; The 
resulting function is 


JO = X )مره‎ — t) (12-27) 


` If first we set aside the random nature of the events, we obtain a char- 


Quantum mechanics and path integrals 


acteristic functional equivalent to that of Eq. (12-16) as 
© = exp D a; f k(thu(t — t) at] (12-28) 
7 


Next, if we include the presumed random nature of the scale heights 
of the pulses and say that the probability of obtaining a particular 
scale height a; for the jth pulse in the region da; is p(a;) da;, then the 
characteristic functional becomes 


a = ff °° exp [iD a f kOult - t) at] pla) dar pas) das ٠ ° - 
(12-29) 


Of course, each of these probability functions for the values of a; has 
associated with it a characteristic function or moment-generating func- 
tion. Suppose we call this function W[w] and define it as 


Ww] = fe*p(a) da (12-80) 


Then the expression for can be written as 


ð = II W [ / kult — t) dt | (12-31) 


Now we can proceed as we did for the derivation of Eq. (12-17) and 
introduce the notion that the exact time at which a pulse occurs is 
randomly distributed with a uniform distribution function over the 
interval 0 > 1 < T. If we suppose that there are precisely n pulses in 
this interval, the characteristic functional becomes 


a (3) (12-32) 
where 
y = JWifk@®ut — s) dt] ds (12-33) 


If again we assume, as we did in the derivation of Eq. (12-18), that 
the distribution of the times of occurrences of the pulses fits the assump- 
tions of the Poisson distribution, then we must multiply the expression 
of Eq. (12-32) by the factor i*e-*/n!, where, as before, ñ = uT is the 
expected number in the time interval T. We then sum on ^ to get 


e-H(T—7) 


exp (—uf{1 — W[fk()u(t — s) dt]} ds) (12-34) 


® 
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As a special example of this result, we assume that the pulse shape is 
extremely narrow. In fact, we assume that we can approximate the 
shape function by a delta function, that is, u(t) = 6(t). Then the 
characteristic functional is 


® = exp (—uf{1 — W[k(s)]} ds) (12-35) 


Next, we assume that the distribution of scale heights is gaussian with 
zero mean and a root-mean-square value of ø; in other words, the ordi- 
nary normal distribution is given by 


p(a) da = wie eds (12-36) 
TCO 


In that case, the characteristic function W[w] becomes 


Wio] = e-2 (12-37) 
and for © there results 
@[k(t)] = exp [~uf (1 — e-1) ds] (12-38) 


So we find again that a characteristic functional #[k(t)] can be derived 
to fit our assumed conditions. At any stage in this derivation, approxi- 
mations that would reduce this to a quadratic form may be valid. 
For example, in the case just described a small value of the root-mean- 
square scale height c corresponds to weak signals. If, at the same time, 
the expected number of signals arriving in a time interval is not small, 
then Eq. (12-38) can be approximated quite well by 


& = exp {- / [kù] 1 )12-39( 


A distribution like this is called white noise. 


GAUSSIAN NOISE 


The type of distribution whose characteristic functional is gaussian 
comes up in many situations, and we shall discuss it here. 

We have been working with probability distributions which are 
gaussian, i.e., exponentials of second order in the defining functions. 
Although we arrived at this gaussian functional by making a second- 
order approximation to the exponential term introduced by our 
assumption of a Poisson distribution of random pulses, it is worth 
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remarking that a number of physical processes actually seem so dis- 
tributed by their nature. In traditional probability theory the normal, 
or gaussian, distribution fits physical phenomena which are the result 
of the combination of a large number of independent events occurring 
randomly. This is the conclusion of the central-limit theorem of 
probability theory." The same conclusion applies to distribution 
functionals and results in the fact that many important cases for study 
of physical phenomena have gaussian distributions. For further ref- 
erence, we write here the most general form of a gaussian characteristic 
functional as و‎ 


P = CiskW F(t) dtg—Wf Skk) A (t,t) dt dt’ (12-40) 


The first factor in this expression can be removed by a shift of the 
origin defining f(t), as we discussed in deriving the distribution of 
fluctuations of voltage around a d-c level. Thus, we could define 
f! = f — F(t). Next we note that, if the system we are describing 
behaves in a manner independent of the absolute value of time, then 
the kernel A (t,t’) must have the form A(t — t’). 

In actual physical situations this particular function A may be 
defined by mechanisms in some sort of experimental situation or by 
approximating a particular piece of reality in such a way that it behaves 
nearly like the distribution function we are studying. We have an 
example of such an approximation in the derivations given above on the 
noise spectrum. For it, A(é,t’) = „A(t — t’). In either case theorems 
of the behavior of the system which result from the use of this function 
will be the same so long as the characteristic functional © can be suit- 
ably approximated by the quadratic or gaussian form of Eq. (12-40). 

Of course, by now we know how to deal with gaussian functionals, 
since we have spent quite a bit of time in the preceding chapters manip- 
ulating them in one way or another. In this particular case the appear- 
ance of the factor 7 is different from that in typical quantum-mechanical 
cases. This means that functions which were real in Sec. 7-4, for 
example, are imaginary here. However, this does not require any 
review of the mathematical aspects of the subject; it simply is an aware- 
ness of and preparation for certain differences in detail in the results. 

The probability distribution which corresponds to the characteristic 
functional of Eq. (12-40) is 


PISO] = exp {— SSO — POMC) - FU)IBGt).dt dt} (12-41) 


1 Ibid., pp. 213ff. 
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where now the function B(i,t') is a kernel reciprocal to A(t,t’). That 
is, the functions A and B are related by 


SA(t7)B(r,s) dr = ô(t — s) (12-42) 


Problem 12-1 Prove this. 


All the parameters of the distribution can be calculated from the 
characteristic functional by the methods introduced in Chap. 7. 

We shall now study in more detail some of the physical character- 
istics of gaussian noise that is time-independent; i.e., we shall study 
distributions whose characteristic functional is 


P = EmVISKORO AG) at ae’ (12-43) 


This function A(r) is called the correlation function. Equation 
(12-43) means that the probability of observing a particular noise 
function f(t) is 


PIFO] = HIII OIB) dt av’ (12-44) 


The function B appearing in this last expression is the inverse of the 
correlation function A. That is, [B(t — s)A(s) ds = ô(t), or, if 


E(w) = JA (r) dr (12-45) 


is the Fourier transform of A(r), the Fourier transform of B(r) is 
1/P(). 

We shall begin by calculating some of the properties of this distribu- 
tional. We first show that the average value of the noise signal van- 
ishes. That is because the average value of the noise function at a 
particular time ¢ = a is, as in Eq. (12-13), 


Ha) = -i (12-46) 


In this expression, the functional derivative of in Eq. (12-43) is 
given by (cf. Sec. 7-2) 


0% 
پک ا و‎ 12-47 
skla) [= [OAG - a) a] 8 (12-47) 
and, if it is evaluated for the particular function k(t) = 0, then it 
becomes 0. 
Next we calculate the average of the square of the noise function 
or, better, the expected value of the product of two noise functions at 
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times a and b. This is called the correlation function of the noise. 
It is [by differentiating both sides of Eq. (12-12) twice] 


8 - 46 - وير يو - = (MIO)‏ 


— [fk(t) A(t ح‎ a) 011[]])( AW’ — b) diJ® (12-48) 


and, if this is evaluated for the function k = 0, it is simply A(b — a). 
That is why A is called the correlation function. 


NOISE SPECTRUM 


A most useful characteristic of the noise distribution is the power 
spectrum of the noise (cf. Prob. 6-26), which is defined as the mean 
value of the square of the Fourier transform of the noise function, 
that is, the mean square of 


lw) = Sie dt (12-49) 
By using our previous results, we can evaluate this as 


(lelo) = ([f(a)e* da ff(b)e-#? db) 

JJ(f(a)f (0) ee da db 

= ffA(b — aje- da db 

= {P(w) da (12-50) 


Here we have made use of the function E(w), the Fourier transform of 
the correlation function A [cf. Eq. (12-45)]. 

If we carried out the integration defined in the last step of Eq. 
(12-50), we would, of course, get an infinite result. Therefore, the 
mean-square value which we are attempting to work out can be 
defined only for some finite time interval. If we take a unit time 
interval, then we say that the mean power per second is 


Mean of |¢(w)|? per second = E(w) (12-51) 


We can apply some of these general results to our special example of 
noise produced by a multitude of small pulses. The correlation func- 
tion for our problem is yA(r) introduced in Eq. (12-22). That is, 


A(r) = wf gg(t + r) dr (12-52) 


This means that the power function, actually called the power spec- 
trum since it is defined in terms of frequency, is 


P(w) = ufg(t)g(t + re dr dt = uly(w)|? (12-53) 
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where y(w) is the Fourier transform of our pulse function g(t). We can 
explain this simple result more directly for our problem as follows. 
If the pulses occur at times t;, so f(t) = 3 g(t — t;), the Fourier trans- 


form of f(t) is ¢(w) = N y(w)e®t:. Thus the square of ¢(w) has the 


t 


average 


(1o(w)|*) = | X, IC) eie] (12-54) 


But, since the times t; are at random and independent of ¢; for j كه‎ i, all 
the terms for 7 عد‎ j average out, because the average of e+) is zero. 
Only the terms with 7 = j remain. Each is |y(w)|?, and they are uT in 
number; so the mean of |¢(w)|? per second is u|y(w)|?. 

In the special case that the characteristic function can be approxi- 
mated by the white-noise characteristic of Eq. (12-25), the function 
A(t — ť') = const 6(¢ — t’). This means that E(w) is independent of 
w and there is the same “power” per unit frequency range [mean 
|¢(w)|2 per second] at all frequencies. 

The distributions we are describing can very conveniently be 
described by giving the probability distribution not for f(t) but for its 
Fourier transform ¢(w) directly and the characteristic functional not 
in terms of k(t) but its Fourier transform K (w). 


K(w) = fk(@)e— dt (12-55) 
Using these functions, the characteristic functional for the noise distri- 
bution corresponding to Eq. (12-43) is 

p = e*I Ew) Ow) do/2x (12-56) 
by direct substitution of the inverse of Eq. (12-55) into Eq. (12-43). 
The corresponding probability functional of Eq. (12-26) is 

P = [(ه)1/6]*|(مو|] ]افتجم‎ da 2r (12-57) 
We can deduce the result of Eq. (12-57) from Eq. (12-56) very directly 
in the following way. We note that 

eil kD S() dt — (م)و(ه)ع1]قى‎ da 2r (12-58) 


so that Eq. (12-14) implies 
P= fpe Ewo) wlr DK (w) (12-59) 


If we now imagine the possible values of w to be discrete and sepa- 
rated by an infinitesimal spacing A, the integrals in the exponent in 
Eqs. (12-56) and (12-57) can be replaced by Riemann sums, and our 
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path integral becomes 


P = [| fe-@M Rowe AKO) AK (w) (12-60) 


w 


The integral for each value of w can be done separately (by completing 
the square), and we get 

— (4/2)\¢@)|* 
Ile +% (12-61) 


wW 


Putting the product together gives Eq. (12-57). It is clear that what 
happens at one frequency is independent of what happens at another, 
and the signal strength ¢(w) at frequency w is distributed as a gaussian 
with a mean square proportional to P(w). 


BROWNIAN MOTION 


It is usually true that the path integral method does not really help 
to get the solution to problems that cannot be solved in some other 
manner. Nevertheless, someone who has followed us this far and who 
is now familiar with path integrals will find its mode of expression and 
logic very simple and direct when applied to probability problems. 

For example, in the theory of brownian motion we might have a 
linear system—say, a damped harmonic oscillator being driven by a 
fluctuating force f(t). Assume the mass of the oscillator equal to 1, 
and we must solve 


ë — yi + woz = f(t) (12-62) 


where x(t) is the coordinate of the oscillator. If the function f(t) is 
not known but is given by a known probability distribution P,[f(#], 
what is the probability distribution P.[x(¢)] for the various responses 
x(t)? Equation (12-62) relates x to f; that is, for each f(t) there is an 
x(t). Hence the probability of given xs is the same as that for the 
corresponding f’s, or 


P[x()] De) = PASO OFO (12-63) 


where z is related to f via Eq. (12-62). In general, we must be very 
careful of the relation of Dz(t) to Df(t), there being an analogue of a 
“Jacobian” between the “volume” elements. But if fand z are linearly 
related (as above), this jacobian is a constant; so if, as is usual with 


_ path integrals, we can trust ourselves to be able to normalize our 
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answer in the end, we have 
P{x(t)] = const )رط‎ — yit + woe) (12-64) 


which gives us a formal solution. If P; is gaussian, then P, is and 
the problem may be worked out in many ways, the most evident being 
by the method of Fourier series if wo? and y are independent of time. 

At any rate, many problems can be set up and solved or partly 
solved by using Eq. (12-64) as a starting point. We shall look at a 
specific example. A fast particle goes through matter in which it 
receives small, sharp alterations in velocity as a result of passage by 
nuclei. After going through a thickness T, what is the probability it 
will emerge a distance D from the origin (the extension of its original 
straight-line path) and will be moving with deflection angle @ as in 
Fig. 12-1? 

We assume that the interactions cause no measurable loss in the 
longitudinal velocity of the particle and that the matter through which 
the particle is passing is homogeneous. Further, we assume that 0 is 
always small and that the motion is the result of a large number of 
collisions each of which has a small effect. We assume that the 
expected number of collisions in the infinitesimal thickness dt is u and 
that the deflection suffered in each collision is given by the angle A, 
which is governed by the probability distribution p(A) dA. We further 
assume that this probability distribution results in a mean-square 
value of A given by 


ih © A2p(A) dd = o? (12-65) 


and we shall use the substitution uo? = R. 


Fig. 12-1 A fast particle impinges perpendicularly on a slab of matter of thickness T. 
After traveling through a thickness ¢ measured parallel to its original line of flight, it is 
deflected away from its original trajectory (as extended) by a distance x owing to a number 
of interactions with the nuclei in the material. Eventually, it emerges from the slab a 
distance D from the point O, at which it would have emerged with no deflection, and is 
traveling.in a direction that makes an angle 0 with its original direction. 
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We shall confine our attention to the motion as projected onto a two- 
dimensional plane containing the original path of the particle. Motion 
in a plane normal to this will follow similar rules, and the motion in 
either plane can be considered independently of the other. We shall 
use t to measure the depth of penetration into the slab, 6 to represent 
the instantaneous direction of motion in the plane we are considering, 
and «x to measure the position of the particle away from an extension 
of its original path of motion, as shown in Fig. 12-1. These parameters 
are related by dx = 0 dt, or ã = 6. 

We assume that the deflections of 0 occur suddenly, so that 6 = f(t), 
where the functions f are a set of randomly spaced delta functions 
having random scale heights. This means that ë = f(t) and P,f(é)] 
has the characteristic functional 


P = ]سج‎ l1—W [k(s)]} ds (12-66) 
where 
Wio) = Sp(A)e*4 dA (12-67) 


We note that the mean value of A is assumed to be 0, and these deflec- 
tions themselves are assumed small. Now if we expand G(w) as 


2 
Ww] = if p(A) (1 + twA — i qs 1 dA (12-68) 
and use terms only through second order in A to get W[w] = 1 — wo?/2, 
then 
P = e—VRS tk(s)}? ds (12-69) 


This in turn implies that 


PAS] = e-2 (® at (12-70) 
Hence 
P,{x(t)] = const exp |- only, [@(t)]? 1 )12-71( 


We wish to evaluate the probability distribution P(D,6), which 
gives the probability that the particle will exit with the displacement 
D and angle of motion 0 under the assumptions that it enters with the 
initial conditions (0) = 0 and 2)0( = 0. We are concerned not with 
the exact path that the particle takes in the material, but only that 
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the particle exits with the conditions «(7) = D and ¿(T) = 6. Thus, 
we express this probability distribution by an integral over all paths as 


PD) = | exp (- ghe at) 902:00 (12-72) 


where the paths included in the integral satisfy the assumed end-point 
conditions. This integral can be carried out by the methods of Sec. 
3-5. The integral is a gaussian and becomes an extremum for the path 


zA = 0 (12-73) 


The solution of this equation, which satisfies our assumed boundary 
conditions, is 


x(t) = (3D — 67) (x) + (6T — 2D) (x) (12-74) 


By using this path in the integrand of the exponential function in 
Eq. (12-72), we find 


1 Ts 6 OTN? 0? 
sR fe ï? dt = zr (2 = ف‎ FPT (12-75) 


which means that our required probability distribution is 


P(D,0) = const exp |- و‎ (2 — F) — IRF | (12-76) 
7 RTS 2 2RT 

In some practical cases we may really be concerned not with the 
exact linear spacing of the particle away from our assumed origin 
point but, rather, with the deflection angle at which it leaves the slab. 
Given the overall distribution function of Eq. (12-76), it is simple to 
evaluate the distribution function in angle alone by integrating over 
all values of D. The result is e-“/287), This is an expected result, 
because we have already assumed that the mean-square value of the 
deflection angle which would be acquired in a unit thickness is RF, so 
this value in a total thickness T should be RT. 

Suppose next we look only at particles which emerge traveling in a 
specific angle @ and consider the distribution function of the emerging 
positions D of those particles. We find that the probability distribu- 
tion has a maximum at D = 67/2. This would be the position we 
would expect if the final deflection angle 0 were acquired in a smooth 
manner as a linear function of thickness starting from 0 and building 
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up to its final value. In that case its average value during the passage 
through the slab would be @/2. 


Problem 12-2 Show that the constant required to normalize the 
probability function P(D,6) dD dé is 


EN ع‎ 
const = RT? RT oth 


QUANTUM MECHANICS 


In this and the following sections we should like to see how to formulate 
statistical problems in quantum mechanics. In quantum mechanics 
there are probabilities involved in an intrinsic way, because even a 
known state implies probabilities to be found in other states. . But in 
addition there may be extrinsic uncertainties. The state, for example, 
may not be known—we may know only that the state is such and such 
with a certain probability. This situation is analogous to the classical- 
mechanics situation in which the initial conditions are not known and 
only a probability distribution for such conditions is available. We 
have already dealt with such a situation in statistical mechanics 
(cf. Chap. 10), but that is a very special case in which the state of 
energy E has the probability .“الحم‎ Here we shall be more general. 

Again, under a given external potential, say V(t), the behavior of a 
quantum-mechanical system can be worked out, but what can we say 
if that potential is uncertain and has a probability distribution 
P[V()] DV (t)? Need we actually solve the problem for each V(t) 
and then average, or is there some way to formulate the problem after 
the average of V(t) is taken? (We hope so, because it often occurs 
that the solution of a statistical problem after an average is taken is, 
in fact, much easier than finding the general solution of the original 
problem for a wide range of conditions.) We shall find such a formu- 
lation in this section. Then we shall go on to discuss situations in 
which a quantum-mechanical system is disturbed not just by a classical 
system but by another quantum-mechanical system about which there 
are statistical uncertainties. 4 
- Our main purpose in this chapter is to show how these and other 
questions may be formulated. We shall not deal in detail with solving 
the special problems mentioned; they are brought up only to help us 
understand the more general formulations we shall arrive at. — 

We wish first to discuss the analogue of brownian motion for a 
quantum-mechanical system. That is, we shall suppose: that a 
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quantum-mechanical system whose unperturbed action is S(q) is 
under the influence of an external potential V(t) such that its action 
is! 


Sr(a) = Sla) + SOV (À at (12-78) 


Suppose we ask: What is the probability that, starting at some early 
time t; with the coordinate q(t;) = q;, we shall arrive at a final time t; at 
position q? It is the square of an amplitude: |K(q,,t;;q:,t:)|?. Or 
again, if we specify that initially a system be in a state of wave function 
¢(q) and finally in wave function x(q), the probability of transition 
from ¢ to x is 


P[x(4);4(4)] = WSSx*(a) K (artriti) (qs) day dail? 
= SSI Ix* 022 (IR O ir 00 $(q;) o*(q;) 
dq; dq; dq, dg, (12-79) 


It is evident that all such problems can be solved if we can evaluate 
K(q,,ts dit) K * ررارر9)‎ i,t) )12-80( 


The first factor involves the path integral feist Dq(t), whereas the 
second complex conjugate? one is fe7Sle® Dg(t). Each integral is 
over paths with appropriate end points. In writing the product of 
Eq. (12-80), we shall call the path variable in the second integral q’(¢) 
and we can then express Eq. (12-80) as the double path integral 


f Na ash 1 Dq(t) Dg (i) (12-81) 


The summing of such inkorra over various cand Saint gives the 
required probability. 

If the potential V is acting, we should replace S in Eq. (12-81) by Sy, 
and the expression becomes 


ffet Sla@l—Sta’ (IHS OVO at—Sa’ OV) dt) Daft) Dq’ (t) (12-82) 


But now suppose the potential is known only in a probabilistic sense; 
i.e., we know that there is the probability Py[V (t)] DV(@ that the 
potential is V(t). Then the probability to go from ¢ to x is given by 


1 We shall do everything as though there were only one coordinate g. . One can 
` immediately generalize to several coordinates q; (so V is a set of potentials V;) and 
to cases in which the coefficient of V(t) in the action is not simply q but some more 
complex operator. . 

2 We suppose that Slq()] i is real and that our units are so defined that ù = 1, as 
‘in Chap. 11. 
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Eq. (12-79) calculated for each V(t) and then averaged over all V(t) 
each with the weight Py[V()] DV (t). This is then 


Prob (¢ ج‎ x) = SSS Sx” (ax) (41,9739) 0(4 o*l) dq: dq; 1 0 
2-83 


where J is the average of Eq. (12-82) over all V(t) with weight 
Py[V()] DV (t); thus 


J= Sf fet SOIS O eiS a= )017 © aPy[V(t)] Dq(t) Dq' (t) DV (t) 
(12-84) 


with the integrals taken between appropriate end points q(t) = qi; 
dit) =, at) =a, V(t) = q,. Actually, this choosing of end 
points and then integrating over various values with wave-function 
distributions depending on the problem [as in Eq. (12-83)] is simply a 
sum of J’s for different end conditions, and we shall hereafter simply 
forget this and speak as though with J we already have our probability 
—it being left to the reader to remember that a bit more has yet to be 
done. This is so that we can concentrate on the main feature, the 
evaluation of the double path integrals needed to calculate J. 

In this form we can do the integral over V(t) explicitly and see that, 
to find the probabilities after averaging, we must evaluate a double 
path integral 


J = ffos ONS — q(t] Da(t) Da’) (12-85) 


where ®[k(é)] is the generating functional belonging to the probability 
distribution Py, so 


DRA] = fe Oro aP VÀ] DV À (12-86) 


Equation (12-85) then answers our challenge to express the answer 
in a form valid after the averaging. It involves evaluation of the 
double path integral. How to evaluate it is, of course, another ques- 
tion, but the methods discussed in this book may be useful. In these 
sections we are discussing only how various problems may be 
formulated. 

As an example of the application of Eq. (12-85), suppose V(t) is 
gaussian noise with zero mean and characteristic function A (t,t) as in 
Eq. (12-46). We must evaluate 


J = ff exp ¢{S[q(] — Sia’ (]} exp {—14Sfla® — oO) 
X [a(t’) — g(t) A(t’) dt dt'} Dalt) Da’ (t) (12-87) 
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Because in the new factor at least the و‎ and q’ appear only quad- 
ratically, some of the methods previously discussed for quadratic 
forms may be useful. Of course, if S(q) is itself quadratic, correspond- 
ing to a harmonic oscillator, the path integrals can be evaluated 
exactly by using the methods of Sec. 3-5. 


INFLUENCE FUNCTIONALS 


Now we wish to discuss the behavior of a quantum-mechanical system 
whose general coordinate we shall take to be q in interaction with 
another system whose coordinate we call 0.7 We shall suppose that 
all measurements which are to be made are on system q only, and no 
direct measurements of the system Q will be made. For example, we 
may be interested in how an atom makes transitions because it is in 
the electromagnetic field and can radiate. We contemplate study- 
ing only the atom and will not directly measure the light coming 
from it; then و‎ are the atomic coordinates and © the coordinates of 
the field. If we study it the other way—that is, if we only observe the 
light from the atom, emitted, absorbed, or scattered, but never ask for 
any quantity directly involving the atom’s variables—then we may 
use our present analysis with © being the atomic coordinates and q 
those of the electromagnetic field. If, for example, the theory of the 
index of refraction is wanted, then و‎ are again the field variables and 
Q the variables of the piece of matter through which the light goes. 
For one further example, suppose the behavior of an electron in a 
crystal (or an ion in a liquid) is to be studied: the measurements to be 
analyzed involve directly only the position of the charge, not the 
material of the crystal. For example, we might wish the current 
(electron velocity) generated in some circumstance, but we are not 
contemplating correlations with the number of phonons produced. 
Then q can be the coordinates of the electron and © all the other coordi- 
nates of the matter of the crystal. 

Let S[q(t)] be the action of system q, So[Q(é)] that of the environ- 
mental system alone, and S,{q(t),Q(f)] that of the interaction between 
the environmental system Q and the system of interest g. The action 
of the combined system is S[q(t)] + S [Q] + Sdq@,Q(@], and the 
probability of any event involving the combined system can be evalu- 


+ 0 stands for any number of coordinates—this other system may be, and gener- 
ally is, very complex. We shall just carry one Q variable, but nothing essential 
will be lost. 
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ated from the double path integral, an obvious generalization of Eq. 
(12-81), and now written as 


J = ff exp (i{ Sl] — Sig )0[ + Sinda), QO] 
— Sind O, O] + SolQ(0] — Sl Q’()]}) Da) DOW) Da’) DO") 
, (12-88) 


But, if we need no measurements on Q(t) and if only the dependence on 
q(t) need ever be studied, then we can write our answer in the form 


J = ffetseol-strOuF[g(t),q’()] Da) Da’ (12-89) 


where we shall call the functional F[q(t),q’(t)] the influence functional. 
It is a functional of the two functions q(t) and q(t), and for this par- 
ticular problem it is given by 


Fla), O = J ff exp GSO- SAO + Sula, QO) 
f 
- Sd, Q' OIH DQO DO (12-90) 


The sum on f means we are to sum over all possible final states of Q. 
This is because no measurement on Q is to be taken, and all final states 
of the environment are possible. Therefore we must add together the 
probabilities [i.e., the J functions of Eq. (12-88)] of all. In coordinate 
representation, for example, 2 just means that at some final time ty 


7 
after we are no longer interested in the interaction we must take 
Q(t) = Q'(t,) = Q; and integrate over all Q;. 

To summarize, the behavior of a system in any environment can be 
discussed in terms of a double path integral like Eq. (12-89), where F is 
a property of the environment—its “influence” on the system. It 
summarizes all of the environment that is relevant to q(t). Two dif- 
ferent possible surrounding conditions, say, A and B, might physically 
be very differently constructed; nevertheless, if they happen to lead 
to the same functional F, they are indistinguishable as far as the 
behavior of the q system is concerned. 

This F is somewhat analogous to the use of “external force” in 
separating the behavior of interacting systems classically. We can 
analyze the motion of q alone provided we suppose we know what force 
is produced (as a function of time) by the environment. These new- 
tonian equations of motion of g alone are the rough analogue of Eq. 
(12-89), whereas Eq. (12-90) corresponds to the calculation of the force 
produced by a given environment. Two different environments which 
produce the same force on q are equivalent. Actually, the analogy is 
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only rough; for F contains the entire effect of the environment including 
the change in behavior of the environment resulting from reaction 
with g. In the classical analogue, F would correspond to knowing not 
only what the force is as a function of time, but also what it would be 
for every possible motion q(t) of the object. The force for a given 
environmental system depends in general on the motion of q(t), of 
course, since the environmental system is affected by interaction with 
the system of interest q. 
Weare therefore led to study the properties of influence functionals. 
We shall be content to list a few such rules and give some suggestions 
on how they are arrived at. 
` Ruel 


Fiq,7O]* = Fla’ [0)و,8)‎ (12-91) 


wheré the asterisk means complex conjugate. 

Rule II. If the argument functions q(t) and q/(t) are set equal for t 
exceeding any value a, then F does not depend upon the actual values 
of g(t) fort > a. 

Rule III. If F; is the influence functional for a particular environ- 
ment 2 and we do not know what the environment actually is but know 
only that the probability of its being 7 is w;, then the effective influence 
functional (for calculating all probabilities) is 


F = J uf; (12-92) 


Rule IV. If the system q is simultaneously in interaction with two 
external systems A and B and if A and B do not interact directly with 
each other and there is no correlation between them in the initial con- 
ditions, then 


F =F4:Fsg (12-93) 


where F4 is the influence functional if A alone were interacting and 
Fz is that if B alone were interacting. 

Rule V. If the functional F can be adequately approximated by 
the form 


F = exp {ifla® — q(D]V( dt} (12-94) 


the system q is acting as though under a classical potential V(t) with 
action of interaction fg(t) V (t) dt. If itis of the form F(q,q') = ®[q(t) — 
q(t)], where %[k(t)] is any functional, the environment is equivalent 
to a classical but uncertain potential V(t). [® is the characteristic 
functional for the distribution of V(t).] 
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That rule I is true is evident directly from Eq. (12-90). This 
expression also explains rule II, but in a much more subtle way. Note 
that for any system with any definite action S.(Q) and any given 
initial state 


ilS O-O 50) D) = 1 )12-95(‏ اه 
1 


This is because the integrals and the sum over final states 3 are 
1 
equivalent to 


SE (Qrtr;Qat)K* (Qrt Qati) DQ, = 5(Q: — Qi) (12-96) 


by Eq. (4-37). Thus, if the initial wave function were ¢(Q;), we would 
multiply by ¢(Q,)¢*(Q;) as we did in Eq. (12-79) and integrate to get 


So(Q:) D*E: — Q;) dQ: dQ; = J\o(Q)|? dQ 
=1 (12-97) 


Now notice that, if we put 'و‎ )( = q(t) for a given fixed q(t) for all time 
in Eq. (12-90), we have an expression just like Eq. (12-95) where the 
effective (and definite) action is 


SQ] = SRH] + Sina, QO] 
with 
Sa[Q’(0] = SQ’) + Sina, Q H] 


as required, as long as g'( = q(t). Hence F[q(t),¢(é)] = 1. 

The same argument limited to the time range a < t < ty, using a 
relation like Eq. (12-96) but with t;, Q; replaced by a, Qa, shows that, if 
q(t) = q(t) for t > a, the dependence of F on q(t) for t > a drops 
away, because the right side of Eq. (12-96) does not depend on q(t) 
fort > a. 

Rule 111 is an evident result of the fact that probabilities are deter- 
mined by adding the value of J over various circumstances. 

Rule IV is evident from Eq. (12-90) when it is realized that the 
conditions of the rule imply that the action that goes into Eq. (12-90) 
is Soa[Qa(d)] + Sint algat), Qa] + SoslQs(0] + Sine slg), Q(t] and 
that the exponential of the sum becomes a product, as does the integral 
F, if the initial state is itself a product of wave functions. 

Rule V is merely a statement of our results shown in Eqs. (12-82) and 
(12-85). 

These are some of the general properties of influence functionals. 
Calculations with them involve the various methods for doing path 
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integrals applied to Eq. (12-89). We shall conclude this section by 
discussing certain important influence functionals. 

Just as gaussian probability distributions and gaussian noise dis- 
tributions are simple and important, so influence functionals which 
depend on q(t), q'(t) as an exponential of a quadratic form—which we 
shall call gaussian influence functionals—are particularly important. 

First, if the environment is a set of harmonic oscillators in their 
ground state (or at a given temperature) coupled linearly to the system 
of interest g, evaluation of Eq. (12-90) shows that F is gaussian. But 
gaussian influence functionals, like gaussian probabilities, occur in 
good approximation in a much wider class of situations, namely, where 
the effect is the result of a very large number of influences, each of 
which by itself has little effect.. For example, consider an atom in 
weak interaction with each of the large number of atoms of an environ- 
mental gas. The influence of one atom A is very small, so its influence 
functional F 4 differs only slightly from 1. However, in view of rule IV, 
the complete F is the product of many such factors, which becomes 
(nearly) the exponential of the sum of a small contribution from each. 
This contribution expanded to first and second order in the interaction 
with each atom leads to influence functionals of the gaussian type. 

As an application of this conclusion, a piece of metal placed in a 
cavity resonator affects the resonator in a simple linear way summariz- 
able by one impedance function, even though the multitude of elec- 
trons in the metal behave in such a complex manner. The influence 
functional of the metal (Q) on the cavity oscillator (q) is nearly a 
gaussian, and to this extent the metal is equivalent to some set of 
harmonic oscillators which would produce the same influence 
functional. 

The most general exponential functional involving q(t) and q(t) in 
linear form is 


F[q(),q’'(] = exp [SOV (H at — if OU di) (12-98) 


for arbitrary and complex V(t) and U(t). If this is to be an influence 
functional, however, it must satisfy the conditions of our five rules. 
Rule I requires U(t) = V*(é), and rule II implies U(t) = V(t); hence 
U and V are equal and V is real. Thus the most general linear func- 
tional is that equivalent to the action of a classical external potential 
in accordance with rule V. 

We need not discuss this simple case further; for it is completely 
analyzable just by adding q(t) V(t) to the hamiltonian of the unper- 
turbed problem. If the exponent has both quadratic and linear terms, 
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the linear term can be factored out, so via rule IV we can say it is a 
classical potential plus the effect of a purely quadratic functional. 

The most general exponential functional which involves its argu- 
ments purely quadratically is of the form 


Fla,q@) = exp ! - ff EGO) + 6G HOY) 
+ VOOL) + COIA at} (12-99) 


for arbitrary and complex! a, 8, y, and ô. The integrals on t’, t are over 
the entire interesting range of time, but we always take t > t. This 
is no loss of generality, of course, but it is convenient for later analysis. 
For this to be a satisfactory influence functional, we must have from 


Rule I 

BEL) = a* (t,t) (12-100) 
and 

y(t) = *ة‎ (t) (12-101) 


Rule II gives us a great deal of information, for putting q(é) = q'(¢) 
for t > a and, assuming ¢ > a, t < a, the expression [which is part of 
the integral in Eq. (12-99)] 


J, [E REO) + LEOA) 
+ VDOT E) + LOdE (12-102) 


must be independent of q(t) for t > a and arbitrary q(t’) and of q(t) 
for t’ < a. This requires that 


ôt, t) = —a(t,t’) 
y(t) = =L) 


as long ast > a, t <a. But since ais arbitrary, Eqs. (12-103) must 
hold for all t, t' (under the continuing restriction i > t’). 

Therefore, the most general gaussian influence functional depends 
on only one complex function a(t,t’) and is of the form 


exp {- ji 1 UO — PONOA) — "و‎ a*t] at at} (12-104) 


In the case that a(t,t’) is real, say, A(t,t’) our functional is equivalent 
to the exponential of Eq. (12-87), and we have the equivalent of a 
noisy classical perturbation. In general, in quantum-mechanical 


(12-103) 


1 These functions are defined only for t > ?’. 
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systems a is complex. A special case of importance is that 


a(t) = a(t — t’) 


depends only on the time difference t — t. We are then dealing with 
an environmental system which has average properties independent of 
absolute time. 


To help understand some of the properties of Eq. (12-104), we shall 
ask for the probability that the و‎ system makes a transition from 
energy level n to some other orthogonal level m during an interval of 
time T in the case that a is very small and we can use perturbation 
theory. If we expand F in Eq. (12-104), the leading term, 1, gives 
nothing because the states are orthogonal. The next term linear in a 
has four pieces. One is - f 1 ° u(t)q(Dq(t) at’ dt. When this is 
substituted into Eq. (12-89) in place of F and this evaluated as in Eq. 
(12-83) with ¢ = ,ف‎ and x = ,رف‎ the integral on Dq(t) and Dg'(t) is 
seen to be the product of two factors. One, the integral on q, involves 


| esa | - [f eOe) ar at] do 
which is the transition element (cf. Chap. 4) 
w— ff ODE) at at), 
= — ff‘ nla@alt))n a(t) dt’ dt (12-105) 


The integral on Dg is just Jel Dg’ and is the complex conjugate of 
the transition element m({1)n. Analyzing the other terms in a similar 
way, the total transition probability is 

t 
P(n— m) = ff" [—a(tt) aO) 1(5)م‎ 


— a* (i,t) n(n m(g(t)a(t’) me + a*t) mI) a 7 
+ altt’) m(q())n m(G(t’))n] dt’ dt (12-106) 


If m and n are orthogonal, (1), = 0. If S[g] comes from a constant 
hamiltonian with energy levels Hz for states k, then 


(TD) dani Fat (12-107) 


Only the last two terms of Eq. (12-106) survive, and they are complex 
conjugates of each other, so that 


P(n جب‎ m) = 2 2. / / 1 a(t,t ei EE) dt! dt (12-108) 
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Problem 12-3 For m =n, verify P(m— m) ع‎ 1 — Pin n) 
as required by conservation of probability. 


In the case of a time-steady environment a(t,t’) = a(t — t’). Sup- 
pose we define the Fourier transform 


AO i e(r)e-™ dr (12-109) 


[a(t) is not defined for t < 0.] Then since P(n ج‎ m) in Eq. (12-108) is 
proportional to the time interval over which the integrals extend, 
we can define a rate of transition per second and find the probability 
of transition 


P(n — m) per second = 2ar(Em — En)|Qnm|? (12-110) 
where we break a(v) into real and imaginary parts 
a(v) = ar(v) + iar(v) (12-111) 


We may note that, for a disturbance by a classical potential under 
gaussian noise, a(r) is real [cf. Eq. (12-87)] and the real part of a(v) 
is the power-spectrum function of the noise as defined in Eq. (12-32). 
So, for such classical noise systems 


ar(v) = ar(—v) (12-112) 
and in first-order perturbation 
Rate of transition n =» m = rate of transition m —> n (12-113) 


and both rates are proportional to the power P(w) at the frequency of 
the transition. Thus classical potentials have equal probability of 
causing transitions up and down. 

Another interesting example is when the environment cannot supply 
energy with any reasonable probability. For example, it may be 
initially in the ground state or at zero temperature. We shall call 
such an environment “cold.” For such a situation transitions of the 
system و‎ going up in energy (Em > En) are unlikely. Hence for such 
cold-environment systems 


ar(v) = 0 for « > 0 )12-114( 
and for first-order perturbations 


Rate of transition n > m = 0 11 Hn >A )12-115( 
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Since any a(v) can be written as the sum of one of the type shown in 
Eq. (12-112) plus one of the type shown in Eq. (12-114), it is readily 
apparent that any time-independent gaussian functional is equivalent 
to a system in some cold environment acted on by a fluctuating classi- 
cal potential described by a gaussian expression. This conclusion fol- 
lows from the fact that the product of any two gaussian functions is 
also a gaussian and from rule IV. If the interaction of one environ- 
ment on the system is represented by A,(¢,t’) in the manner of Kq. 
(12-87) and the interaction of the other environment as A2(t,t’), then 
the single interaction term in the single resulting gaussian functional 
is 41 + A>. 


INFLUENCE FUNCTIONAL FROM A HARMONIC OSCILLATOR 


We shall next give an example of how F can be worked out from Eq. 
(12-90) for an environment consisting of a harmonic oscillator with 
coordinates ©, in the ground state, coupled to q linearly with an inter- 


action Sin(g,Q) = CSa Qt) dt. We suppose the oscillator of unit 
mass and frequency w, so that 


So(Q) = SIO0)? + QA] dt (12-116) 
Then 


Fla,q'O1 =) || exp {i f 2400? + 7 


+ CaO QH] at} exp {—i f 240? + 14029"? 
+ CPOO at} DRO DQ’ (12-117) 


where m is the final state and the initial state is the ground state. 
The integral on Q is clearly gaussian, and in fact we have already done 
it; for it is exactly the transition amplitude Gmo worked out in Sec. 8-9 
for a forced harmonic oscillator. The forcing function there called 
y(t) is here Cq(t).t It is therefore given by Eq. (8-145) with n = 0 or 


Gmo = (m!)—*(78*) "Goo (12-118) 
t The reader may prefer to observe that Eq. (12-117) is 


Fla(D,q'(D] = f dQ, K (Qy,t7;Q:,ti) K * Qrt) 60(Qi) 69 (Q) dQ: dQ; 


where K is the kernel of Eq. (3-66) for a forced harmonic oscillator with f(t) = Cq(é) 
and K’ is that with f(t) = Cq’(t). p(Q) is the wave function of the oscillator in 
the ground state. All variables Q;, Q;, and Q; then appear in a simple gaussian 
way and may be directly integrated. We shall then find it as easy to do the finite- 
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with Goo given in Eq. (8-138) and 8* in Eq. (8-143) replacing y by 
Cq(t). Likewise, the integral over Q’ is the complex conjugate of a 
similar expression but with y(t) replaced by Cq’(t) this time. We 
distinguish values with this substitution with a prime. Then the sum 
on final states in Eq. (12-117) gives us 


F(q,q') = 2 Gaiaa = » (m!)—* (48 *) "Go0(m!)—4(— 78’) "Goo 
= Goof (12-119) 


Substitution from Eqs. (8-138) and (8-143) produces, as expected, an 
F of the form of Eq. (12-104) but with 


2 
a(t) = = e~iw(t-"’) (12-120) 


For example, the terms in “وو‎ in Eq. (12-104) come directly from the 
6*6“ in the exponential; for this product by Eq. (8-143) is 


ra Lf aoe at] [J roe at] = E ff" noro 
+ q'(a(t')e#-)] dé! dt (12-121) 


The quantity a(v) defined in Eq. (12-109) is therefore [cf. Eq. (5-17) 
and the Appendix] 


e —iwte—ir ite . 1 
a(v) = = | e~ivte—ivt dt = o | PP. a ws + rôļlw + »| 
(12-122) 
so that the real part of a(v) is 
2 
ar() = = lo + ») (12-123) 


This is zero for positive v». As expected, we have a “cold environment” 
as specified in Eq. (12-114). 

Se ee S eee 
temperature case. For here state n is the initial state with probability propor- 


tional to 6-825, go, in view of rule III, the resulting F is obtained by the expression 
above but with the wave functions $0(Q;) $3 (Qi) replaced by 


const J, (Qi) 6% (0)08 


that is, by the density matrix p(Q;,Q;) worked out in Prob. 10-1. The integrations 
are again gaussian. 
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If many independent oscillators of different frequencies are all acting, 
then by rule IV, their ar(v) functions add; so any cold system (to 
this gaussian approximation) is equivalent to a continuum of oscillators 
in their ground state. This follows, since any function az(v), for 
negative v, can be built up of ô functions of the form of Eq. (12-123). 

Another interesting example is the interaction with an oscillator 
at finite temperature. If the temperature is T, the initial state is state 
n with relative probability e~#-/*7, For our case, the absolute proba- 
bility is 


Wn = EmMulkT(] — e-tw/k) (12-124) 
If the initial state were n, the influence functional would be 


Fn = Y Gunaan (12-125) 


instead of the form in Eq. (12-119). Using rule III, we add these 
with probabilities wn, so our final F is 


F = Y Garg, elt )1 — etalk (12-126) 


The sum is difficult to work out directly from Eq. (8-145), but it is 


pee * _ Ql 
F = 00,0067 exp | n cpe] (12-127) 
The ar(v) that results from this in place of Eq. (12-123) is 
C2 feo [kT 1 
ar(v) = Ae | ie 8) 34 v) -+ gholkT — 1 8) = »| (12-128) 


and sums of such expressions of many oscillators constitute the environ- 
ment. Now transitions can go down in energy (w > 0) or up in energy. 

We note that if v > 0, the first ô function fails, whereas if v < 0, the 
second fails, and that indeed 


ar(—|r|) = e*/*Ta 2(+|p}) (12-129) 


This definite relation means that in perturbation theory, if En > En, 


Probability of a transition per second up (m >n) _ e-(En—Em) [kT 
Probability of a transition per second down (n — m) 
(12-130) 
by using Eq. (12-110). 
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Thus, if the system و‎ occupies states n with relative probabilities 
,امدق‎ the net number of up and down transitions will balance out 
and the system will be in statistical equilibrium for weak perturbation 
with the environment. This is just what we expect from the laws of 
statistical equilibrium. Any environment at temperature T producing 
a quadratic influence functional will have the property of Eq. (12-129). 

For an atom as system q in interaction with the electromagnetic 
field at temperature T as the environment, ar(v) is given by an expres- 
sion like Eq. (12-128) integrated over all the modes of the field of 
various frequencies w. It can be split into the cold environment of 
Eq. (12-123) plus a noisy external potential: 


2 1 2 
ar(v) = fe ôlw + v) yS + v) + ôlw — [(س‎ 


(12-131) 


The first term produces only transitions down in energy and is called 
spontaneous emission. The second produces transitions up and down 
with equal ease and is called induced emission, or induced absorption. 
We say that the transition is induced by an external potential or noise 
whose mean-square strength at frequency » varies with temperature 
as 1/(e*”/kT — 1). This is the way Einstein first discussed the black- 
body-radiation laws. As we see here, any environment giving a 
quadratic influence functional at temperature T (we say it is an 
environment responding linearly) can be treated in the same way. 
Many people have extended Einstein’s argument to other systems, 
like the voltage fluctuational noise in a resistor at temperature T. 
The first term measures the rate at which energy is taken out of our 
system g in a one-way manner. It measures the amount of “dissipa- 
tion” produced by the environment (e.g., electrical resistance of a 
metal or radiation resistance of the electromagnetic field). At tem- 
perature T we can then say that things behave as if, in addition to the 
dissipation, there is a noisy signal generated by the environment whose 
mean square at each frequency is proportional to the dissipation at 
that frequency and to (e*”/kT — 1)-1, This is called the disstpation- 
fluctuation theorem. 
We cannot pursue this subject further here.} 


1 The subject of influence functionals is discussed in detail by R. P. Feynman and 
F. L. Vernon, Jr., Ann. Phys. (N.Y.), vol. 24, p. 118, 1963, and W. H. Wells, Ann. 
Phys. (N.Y.), vol. 12, p. 1, 1961. An application to calculation of mobility of the 
polaron is in R. Feynman, R. W. Hellwarth, C. K. Iddings, and P. M. Platzmann, 
Phys. Rev., vol. 127, p. 1004, 1962. 
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12-10 CONCLUSIONS 


In these applications of path integrals to probability theory it is evident 
that, if the integrands are gaussian, we can make considerable use of 
the technique. But these problems are precisely those for which 
other methods, not requiring path integrals, are also available to solve 
the problem. One may reasonably question the real utility of the 
path integrals. We can only say that if the problem is not gaussian, 
it can at least be formulated and studied by using path integrals— 
and that we might hope that someday, when the techniques of analysis 
improve, more can be done with it. The only example of a result 
obtained with path integrals which cannot be obtained in simple 
manner by more conventional methods is the variational principle 
discussed in Chap. 11. We hope that further study of these methods 
may yield more such results. 

In the meantime, however, it is worth pointing out that the path 
integral method does permit a rapid passage from one formulation 
of a problem to another and often gives a clear or quick suggestion of a 
relation which can thea be more slowly derived in a more ordinary 
fashion. 

With regard to application to quantum mechanics, path integrals 
suffer most grievously from a serious defect. They do not permit a 
discussion of spin operators or other such operators in a simple and 
lucid way. They find their greatest use in systems for which coordi- 
nates and their conjugate momenta are adequate. Nevertheless, spin 
is a simple and vital part of real quantum-mechanical systems. It is 
a serious limitation that the half-integral spin of the electron does not 
find a simple and ready representation. It can be handled if the 
amplitudes and quantities are considered as quaternions instead of 
ordinary complex numbers, but the lack of commutativity of such 
numbers is a serious complication. 

Nevertheless, many of the results and formulations of path integrals 
can be reexpressed by another mathematical system, a kind of ordered 
operator calculus.! In this form many of the results of the preceding 
chapters find an analogous but more general representation (only for 
the special problems of Chap. 11 is the generalization not known) 
involving noncommuting variables. For example, in this chapter dis- 
cussing influence functionals it must have struck the reader that an 
environment coupled not to the coordinate g but to a noncommuting 


1R. P. Feynman, An Operator Calculus Having Applications in Quantum 
Electrodynamics, Phys. Rev., vol. 84, p. 108, 1951. 
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operator, such as the spin, would be an important and interesting 
generalization. Such things cannot be conveniently expressed in the 
path integral formulation but can be easily expressed in the closely 
related. operator calculus. 

An effort to extend the path integral approach beyond its present 
limits continues to be a worthwhile pursuit; for the greatest value of 
this technique remains in spite of its limitations, i.e., the assistance 
which it gives one’s intuition in bringing together physical insight and 
mathematical analysis. 
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Correlation function, 333-334 

Coulomb potential, 137, 239 

Coupling coefficient, 70-71 

Cross section, 135-139, 152 

Crystal, 212-217 
polar, slow electrons in, 310-320 

Current density, 237 

Cut-off rule, 245-246, 258 , 


d-c voltage, 328-329 
Degenerate states, 160 
Delta function, 53, 87, 281 
Density of levels, 151-152 
Density matrix, 273-277 
Diffraction, electron, 140 

of light, 11-13 

of particles, 47-57 
Dipole approximation, 250 
Dirac, Paul A. M., 23, 255 
Dirac delta function, 53, 103 
Dirac equation, 36 
Dispersion, 225 
Dissipation-fluctuation theorem, 354 


Effective width of diffracting slit, 
49-52 

EHigenfunctions, 145 

of operator, 115 
Eigenvalue for operator, 115 
Elastic collisions, 130 
Elasticity, 220 
Electric dipole, 250 
Electromagnetic field, 39, 189-191 
Electromagnetic mass correction, 254 
Electron spin, 5 
Electrons in metal, 293 
Emission, 153, 247-250, 260-262 
Energy, 28, 46-47, 54, 269 

correction, 253-255 

expansion of kernel, 116-117 

and frequency, 46-47 

in wave functions, 84-86 

levels, 248 

rest, 256 
Energy shift, first-order, 159-161 
Energy-time transformation, 102-105 
Entropy, 272 

of mixing, 291 
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Absorption, 153 
Acceleration, 174 
Action, viii, 26-28 
Amplitude, 5-9, 13-16, 19-21 
, Scattering, 142 
Annihilation, 232 
Approximation of continuity”, 218-222 
Atomic potential, 134-141 
Average, of exponential, 283 
of measured quantity, 112-113 
weighted, of potential, 305 


Bethe, H., 256 

Black-body Radiation Law, Planck, 
294-295 

Boltzmann’s constant, 268 

Born approximation, 128, 131, 169 

Born expansion, 128 

Bose particles, 16-17, 231, 244, 288 

Boson (see Bose particles) 

Boundary conditions, periodic, 92-94, 
213-214 

Box, normatizing wave functions in, 
89-92 

Brownian motion, 165-166, 336-340 


Calculus of variations, 26-28 

Central-limit theorem, 332 

Characteristic function, 325 

of measurement, 106-112 

Characteristic functional, 326 

Charge density, 237 

Chemical constant, 279 

Classical path, 59-60 

Classical trajectory, 26-28 

Cold environment, 350 

Communications, theory of, 329 

Commutation laws, 176 

Commutation between operators, 
115-116 

Complex conjugate of wave function, 
108-109 

Compton wavelength, 245 

Configuration integral, 278 

` Conservation of probability, 82-84 

Constant field, 63 

Continuum, 150-152 

Coordinate space compared to 
momentum space, 101-105 


Harmonic oscillator, 63, 65, 71-73, 
183, 198-234, 276, 286, 351-354 
damped, 336 
forced, 64, 181, 233-234 
Harmonic oscillators, independent, 
209-211 
Heat, 271 
Heisenberg, Werner, 9, 161 
Helmholtz free energy, 268 
Hermite polynomials, 199-200, 203 
Hermitian operator, 114 
Hermitian property, 82, 86, 188 


Independent systems, 67-68 
Influence functionals, 343-354 
Integral equation, 126, 129, 146 
Interaction, 125, 130 

of field and matter, 247-253 

particle and oscillator, 169 

of systems, 66 
Interference, 5-8 

pattern, 56 
Interfering alternatives, 13-14 
Inverse transformation, 111-112 


Jacobian, J., 72-73 


Kernel, 26, 28-29, 32-36 
Kinetic-energy term, 72, 178-179 
Kronecker delta, 86, 208 


Lagrangian, 26-28, 42 
general quadratic, 58 
Lamb, Sir Horace, 253 
Lamb shift, 253, 256-260 
Laplace, 2 
Lattice vibrations, 312 
Least action, principle of, 26-28 
Lee, T., 318-319 
Lifetime, 161 
Linear operator, 114 
Liquid helium, 287-293 


Environment, cold, 352-353 
time-steady, 350 

Environmental system, 343 

Exclusive alternatives, 14 

Expectation value, 217 

Expected value for measurement, 

112-115 
Extremum, 59-60, 62 


Fermi, Enrico, 16 
Fermi field, 231 
Fermi particles, 16-17, 244, 288-292 
Fermion (see Fermi particles) 
Fine-structure constant, 252 
Force, 175, 270 
external, 344 
Forced harmonic oscillator, 70-71 
Form factor, 138 
Four-dimensional symmetry, 237 
Fourier series, evaluating path inte- 
grals by, 71-73 
Fourier transform, 153, 219-220, 226, 
314, 325, 335 
as energy-time transformation, 105 
Free energy, approximate, 285 
Free particle, 42-47, 120 
relativistic, 34-36 
Frequency, 54 
and energy, 46-47 
of wave function, 84-86 
Fresnel integrals, 49, 55-57 
Frohlich, H., 310n. 
Functional, path integral as, 68-69 
Functional derivatives, 170-173 


Gaussian function, 49, 52 
Gaussian integral, 42-43, 58-62 
Gaussian slit for diffraction, 49-54 
Geiger counter, 3 

Gravitational effects, 246 

Ground state, 222-224 


Haga, E., 318-319 
Hamiltonian, constant, 147 
time-independent, 84 
in vector potential field, 191 
Hamiltonian operator, 79-82 
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Orthogonal states, 86 

Orthonormal functions, 86 

Oscillator, field as, 69 
interacting, 205 


Partition function, 269-273, 280 
for symmetric statistics, 291 
Path, 20-21 
Path integral, 34 
double, 341-342 
(See also specific problem) 
Pauli, Wolfgang, 16 
Pekar, S. E., 3187. 
Periodic boundary conditions, 92-94, 
213-214 
Periodic paths, 284 


. Permutation, 288 


Perturbation expansion, 120-127 
first-order, 252-253 
time-dependent, 143-161 

Perturbed potential, 125 

Phase, change of, 45 
of path, 28 

Phonons, 222 

Photoelectric effect, 153 

Photon, 231, 243 

Pines, D., 318-319 

Planck, M., 243 

Planck Black-body Radiation Law, 

294-295 

Planck’s constant, 9 

Plesset, M. S., 230n. 

Poisson distribution, 322 

Polar crystal, slow electrons in, 310- 

320 

Polarization, 224 
oscillations of, 312 

Polaron, 310 

Position measurements, 96 

Positrons, 39 

Potential, effective, 285 
and momentum representation, 102 

Potential-energy term, 72 

Potential function, series expansion, 

62-63 

Power, 153 
mean, 334 
spectrum, 334-336 

Pressure, 270 


Magnetic analyzer, 45 
field, 64 
analyzer, 100 
and Schroedinger equation, 79-80 
Mass, experimental, 257 
Matrix element, 145-161 
for transition, 151-161 
Maxwell, James C., 23 
Maxwell’s equations, 23, 230, 237-240 
least-action principle for, 240-242 
Mean lifetime, 161 
Measurement and probability, 106- 
112 
Measuring equipment and momen- 
tum, 100 
Meson, 231, 254 
Minimum principle, 302-303 
Molecule, diatomic, 139 
polyatomic, 140, 203-208 
Moment-generating function, 325 
Momentum, 8-11, 28, 44-45, 184-189 
amplitude, 96-102 
functional, 187 
measurement, 96 
operator, 116, 185-186 
of photon, 248 
probability, 97-100 
representation, 185 
space, 101-105 


Newton’s law, 175, 251 
Noise, 326-336 
classical, 350 
correlation function of, 334 
gaussian, 329, 331-336 
Nondegenerate states, 160 
Nonrelativistic approximation, 237 
Normal modes, 206-208 
coordinates, 208-212, 216-217 
distribution, 50 
Normalization factors, 178 
of wave functions, 89-94 
Normalizing constant, 33, 61, 194, 275 


Operator calculus, 113 

Operator notation, 80-82, 170, 184- 
189 

Operators (see Operator notation) 


Sound, longitudinal, 225-229 
speed of, 222 
Spherical coordinates, 173 
Spin, 5, 15-16, 39, 231, 264 
Standard deviation, 50-52 
State of system, 57 ' 
Stationary phase, method of, 132 
Statistical problems in quantum 
mechanics, 340-342, 350-351 
Steady states, 84-89 
Succession, events occurring in, 36-38 
of velocities, 189 
Sum over paths, 29, 31-34 
(See also specific problem) ` 
Superconductivity, 293 
Symmetry of crystals, 224 


Temperature, critical, 292 
finite, 353-354 
Thermal equilibrium, 269 
Third-order term, perturbation ex- 
pansion, 158 
Time-of-flight as momentum measure, 
96-97 
Trace, 273 
Transformation, of coordinates, 68 
energy-time, 102-105 
Transformation functions, 110-112 
Transition amplitude, 109, 143-149, 
165, 202 
for hamiltonian, 192-195 
Transition element, 164-195, 349 
of functional, 168-195 
product of two positions, 181 
Transition probability, 248-250 
Traveling wave, 222 
Trial action, 308 


Uncertainty principle, 9-13, 52-54, 85 
Unit cell, 224 


Vacuum energy, 244-246 
Variational method, 296, 303-307 
Vector potential, 79-80 - 

as perturbation, 189-191 
Velocity, mean-square, 177-179 
Virtual state, 157-158 


Principal part of integral, 103-104, 
156-157 
Probability, 2-9, 19, 134 : 
amplitude, 5-9, 13-16, 19-21 
of function, 323 
and measurement, 108-110 
relative, 43, 51 


- of transition, 148-152 


Product, of kernels, 38 
of positions, 181 
Pulses, random, 322-324 


Quadratic actions, 182-184 
approximation, 204 
functional, 184 
Quantized field, 69, 229 
Quantum electrodynamics, 65, 69, 
236-265 
Quantum field theory, 229-232 


Radiation field, 242 

Rayleigh-Ritz method, 307-310 

Relativistic particle, 35-36 

Relativistic quantum mechanics, 102- 
103 

Relativistic symmetry, 259 

Relativistic theory, 38-39, 139 

Resonance, 161 

Root-mean-square (rms) deviation, 
52-3 

Rutherford, E., 253 

Rutherford cross section, 137 


Scalar potentials, 79-80 
Scattering, 14-19, 122-127, 129-143, 
151 
multiple, 130 
second-order, 147 
x-ray, 138 
Schroedinger, Erwin, 22-23 
Schroedinger equation, 22-23, 58, 79- 
81, 129, 195 
Schultz, T. D., 319 
Schwinger, J., 173, 258 
Second-order term, perturbation ex- 
pansion, 154-158 
Separable systems, 66-68 
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Wavelength and momentum, 44-45, 99 
Waves, retarded, 251 
running, 242-243 
transverse, 225, 238 
Weighted averages, 166 
Weisskopf, V. F., 256 
White noise, 331 


X-rays, scattering of, 138 


Voltage, fluctuations of, 329 
Volume as normalizing constant, 
93-94 


Wave, scattered, 142 

Wave function, 57-58, 77 
complex conjugate of, 108-109 
expansion for, 127-129 

Wave number, 54 
and momentum, 45, 47 
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